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Technical Field ; 

The present invention relates generally to bacterial antigens and genes encoding the 
same. More particularly, the present invention pertains to the cloning, expression and 
characterization of the GapC plasmin-bindrng proteins from Streptococcus aysgalactiae, 
Streptococcus agalactiae, Streptococcus uberis, Streptococcus parauberis, and 
Streptococcus iniae, and the use of the same in vaccine compositions. 

Background 

Mastitis is an infection of the mammary gland usually caused by bacteria or fungus. 
The inflammatory response following infection results in decreased milk yield as well as 
quality, and causes major annual economic losses to the dairy industry. 

Among the bacterial species most commonly associated with mastitis are various 
species of the genus Streptococcus, including Streptococcus aureus, Streptococcus uberis 
(untypeable), Streptococcus agalactiae (Lancefield group B), Streptococcus aysgalactiae 
(Lancefield group C), Streptococcus zooepidemicus, and the Lancefield groups D, G, L and 
N streptococci. Some of those species are contagions (e.g. S. agalactiae), while others are 
considered environmental pathogens (e.g. S. aysgalactiae and S. uberis). 

The environmental pathogen S. uberis is responsible for about 20% of all clinical 
cases of mastitis (Bramley, A. J. andDodd, F.H. (1984) J; Dairy Res. 51:481-512; Bramley, 
A.J. (1987) Animal Health Nutrition 42:12-16; Watts, J.L. (1988) J. Dairy Sci. 71:1616- 
1624); it is the predominant organism isolated from mammary glands during the non- 
lactating period (Bramley, AJ. (1984) Br. Vet. J. I40_:328-335; Bramley and Dodd (1984) J. 
Dairy Res. 51:481-512; Oliver, S.P. (1988) Am. J. Vet. Res. 49:1789-1793). 

Mastitis resulting from infection with S. uberis is commonly subclinical, 
characterized by apparently normal milk with an increase in somatic cell counts due to the 
influx of leukocytes. The chemical composition of milk is changed due to suppression of 
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secretion with the transfer of sodium chloride and bicarbonate from blood to milk, causing a 
shift of pH to a more alkaline level. S. uberis mastitis may also take the form of an acute 
clinical condition, with obvious signs of disease such as clots or discoloration of the milk 
and swelling or hardness of the mammary gland. Some cases of the clinical disease can be 
5 severe and pyrexia may be present. For a review of the clinical manifestations of S. uberis 
mastitis, see, Bramley (1991) Mastitis: physiology or pathology, p. 3-9. In C. Burvenich, G. 
Vandeputte-van Messom, and A. W. Hill (ed.), New insights into the pathogenesis of 
mastitis. Rijksuniversiteit Gent, Belgium; and Schalm et al. (1971) The mastitis complex-A 
brief summary, p. 1-3. In Bovine Mastitis. Lea & Febiger, Philadelphia 

10 Conventional antibacterial control methods such as teat dipping and antibiotic 

therapy are effective in the control of many types of contagious mastitis, but the 
environmental organisms typically found in all dairy barns are often resistant to such 
measures. Vaccination is therefore an attractive strategy to prevent infections of the 
mammary glands, and has been shown to be beneficial in the case of some contagious 

15 mastitis pathogens. 

The literature is limited regarding vaccination studies with S. dysgalactiae and S. 
uberis, and variable results have been observed. In some cases, immunization has resulted 
in increased sensitivity to the specific organism and in other cases strain-specific protection 
has been obtained. 

20 For example, previous studies have shown that primary infection with S. uberis can 

considerably reduce the rate of infection following a second challenge with the same strain 
(Hill, A.W. (1988) Res. Vet ScL 44:386-387). Local vaccination with killed S. uberis 

protects the bovine mammary gland against intramammary challenge with the homologous 
strain (Finch et al. (1994) Jn/ect Jmmwn. 62:3599-3603). Similarly, subcutaneous 

25 vaccination with live S. uberis has been shown to cause a dramatic modification of the 
pathogenesis of mastitis with the same strain (Hill et al. (1994) FEMS Immunol Med. 
Microbiol 8:109-1 1 8). Animals vaccinated in this way shed fewer bacteria in their milk and 
many quarters remain free of infection. 

Nonetheless, vaccination with live or attenuated bacteria can pose risks to the 

30 recipient. Further, it is clear that conventional killed vaccines are in general largely 

ineffective against & uberis and S. agalactiae, either due to lack of protective antigens on in 
vzfro-grown cells or masking of these antigens by molecular mimicry. 
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The current lack of existing mastitis vaccines against S. agalactiae or the contagious 
streptococcus strains is due at least in part to a lack of knowledge regarding the virulence 
determinants and protective antigens produced by those organisms which are involved in 
invasion and protection of the mammary gland (Collins et al. (1988) J. Dairy Res. 55:25-32; 
Leigh et al. (1990) Res. Vet. Sci. 49: 85-87; Marshall et al. (1986) J. Dairy Res. 53: 507- 
514). 

S. dysgalactiae is known to bind several extracellular and plasma-derived proteins 
such as fibronectin, fibrinogen, collagen, alpha-n-macroglobulin, IgG, albumin and other 
compounds. The organism also produces hyaluronidase and fibrinolysin and is capable of 
adhering to and invading bovine mammary epithelial cells. However, the exact roles of the 
bacterial components responsible for these phenotypes inpathogenesis is not known. 

Similarly, the pathogenesis of S. uteris infection is poorly understood. 
Furthermore, the influence of S. uberis virulence factors on host defense mechanisms and 
mammary gland physiology is not well defined. Known virulence factors associated with S. 
uberis include a hyaluronic acid capsule (Hill, A.W. (1988) Res. Vet. Sci. 45:400-404), 
hyaluronidase (Schaufuss et al. (1989) Zentralbl. Bakteriol. Ser. A 271 :46-53), R-like 
protein (Groschup, M.H. and Timoney, J.F. (1993) Res. Vet. Sci. 54:124-126), and a 
cohemolysin, the CAMP factor, also known as UBERIS factor (Skalka, B. and Smola, J. 
(1981) Zentralbl Bakteriol. Ser. A 249:190-194), R-like protein, plasminogen activator and 
CAMP factor. However, very little is known of their roles in pathogenicity. 

The use of virulence determinants from Streptococcus as immunogenic agents has 
been proposed. For example, the CAMP factor of £ uberis has been shown to protect 
vertebrate subjects from infection by that organism (Jiang, U.S. Patent No. 5,863,543). 

The y antigen of the group B Streptococci strain A909 (ATCC No. 27591) is a 
component of the c protein marker complex, which additionally comprises an a and p 
subunit (Boyle, U.S. Patent No. 5,721,339). Subsets of serotype la, IL and virtually all 
serotype lb cells of group B streptococci, have been reported to express components of the c 
protein. Use of the y subunit as an immunogenic agent against infections by Lancefield 
Group B Streptococcus infectionhas been proposed. However, its use to prevent or treat 
30 bacterial infections in animals, including mastitis in cattle, has not been studied. 

The group A streptococcal M protein is considered to be one of the major virulence 
factors of this organism by virtue of its ability to impede attack by human phagocytes 
(Lancefield, R.C. (1962) J. Immunol. 89:307-313). The bacteria persist in the infected 
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tissue until antibodies are produced against the M molecule. Type-specific antibodies to the 
M protein are able to reverse the antiphagocytic effect of the molecule and allow efficient 
clearance of the invading organism. * ' - 

M proteins are one of the key virulence factors of Streptococcus pyogenes, due to 
5 their involvement in mediating resistance to phagocytosis (Kehoe, M.A. (1991) Vaccine 
9:797-806) and their ability to induce potentially harmful host immune responses via their 
superantigenicity and their capacity to induce host-cross-reactive antibody responses 
(Bisno, AX. (1991) New Engl X Med. 325:783-793; Froude et al (1989) Curr. Top. 
Microbiol Immunol 145 :5-26: Stollerman, G.H. (1991) Clin. Immunol Immunopathol 
10 61:131-142). 

However, obstacles exist to using intact M proteins as vaccines. The protein's 
opsonic epitopes are extremely type-specific, resulting in narrow, type-specific protection. 
Further, some M proteins appear to contain epitopes that cross react with tissues of the 
immunized subject, causing a harmful autoimmune response (See e.g.^ Dale, J.L. and 
15 Beached, G.H. (1982) J. Exp. Med 156:1165-1176; Dale, JX. andBeached, G.H. (1985) J. 
Exp. Med. 161:113-122; Baird, R.W., Bronze, M.S., Drabs, W., Hill, H.R., Veasey, L.G. 
and Dale, J.L. (1991) J. Immun. 146:3132-3137; Bronze, M.S. and Dale, J.L. (1993) J. 
Immun 151 :2820-2828; Cunningham, M.W. and Russell, S.M. (1983) Infect. Immun. 
42:531-538). 

20 Chimeric proteins containing three different fibronectin binding domains (FNBDs) 

derived from fibronectin binding proteins of S. dysgalactiae and Staphylococcus aureus 
have been expressed on the surface of Staph, carnosus cells. In the case of one of these 
proteins, intranasal immunizations with live recombinant Staph, carnosus cells expressing 
the chimeric protein on their surface resulted in an improved antibody response to a model 

25 immunogen present within the chimeric surface protein. 

A GapC plasmin binding protein from a strain of Group A Streptococcus has 
previously been identified and characterized, and its use in thrombolytic therapies has been 
described (Boyle, et al, U.S. Patent No. 5,237,050; Boyle, et al., U.S. Patent No. 
5,328,996). - 

30 However, until now, the protective capability of GapC has not been studied, nor 

have the GapC proteins of Streptococcus dysgalactiae, Streptococcus agalactiae, 
Streptococcus uteris, Streptococcus parauberis or Streptococcus iniae been isolated or 
characterized. 
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Summary of the Invention 

Accordingly, the present invention provides novel Streptococcus GapC proteins and 
vaccine compositions comprising the same. In one embodiment, the invention is directed to 
an isolated GapC protein selected from the group consisting of: 

(a) an isolated Streptococcus dysgalactiae GapC protein comprising the amino acid 
sequence shown at amino acid positions 1 to 336, inclusive, of Figures 1A-1B (SEQ ID 
NO:4); 

(b) an isolated Streptococcus agalactiae GapC protein comprising the amino acid 
sequence shown at amino acid positions 1 to 336, inclusive, of Figures 2A-2B (SEQ ID 
NO:6); 

(c) an isolated Streptococcus uberis GapC protein comprising the amino acid 
sequence shown at amino acid positions 1 to 336, inclusive, of Figures 3A-3B (SEQ ID 
NO:8); 

(d) an isolated Streptococcus parauberis GapC protein comprising the amino acid 
sequence shown at amino acid positions 1 to 336, inclusive, of Figures 4A-4B (SEQ ID 
NO: 10); 

(e) an isolated Streptococcus iniae GapC protein comprising the amino acid 
sequence shown at amino acid positions 1 to 336, inclusive, of Figures 5A-5B (SEQ ID 
NO: 12); and 

(f) immunogenic fragments of (a), (b), (c), (d) and (e) comprising at least about 5 
amino acids. 

In another embodiment, the invention is directed to isolated polynucleotides 
comprising coding sequences for the above-described GapC proteins, or complements 
thereof, recombinant vectors comprising the polynucleotides, host cells comprising the 
recombinant vectors, and methods of recombinantly producing the GapC proteins. 

In yet other embodiments, the subject invention is directed to vaccine compositions 
comprising a pharmaceutically acceptable vehicle and a GapC protein as described above, 
methods of producing the vaccine compositions, as well as methods of treating or 
preventing bacterial infections in a vertebrate subject comprising administering to the 
subject a therapeutically effective amount of the vaccine composition. The bacterial 
infection is, for example, a streptococcus infection and may cause mastitis. The vaccine 
compositions may further comprise an adjuvant. 
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In other embodiments, the invention is directed to antibodies directed against the 
isolated GapC proteins. The antibodies may be polyclonal or monoclonal. 

In still further embodiments, the invention is directed to methods of detecting 
Streptococcus antibodies in a biological sample, comprising: 
5 (a) reacting the biological sample with an isolated GapC protein under conditions 

which allow said Streptococcus antibodies, when present in the biological sample, to bind to 
the GapC protein to form an antibody/antigen complex; and 

(b) detecting the presence or absence of the complex, thereby detecting the presence 
or absence of Streptococcus antibodies in said sample. 
10 In another embodiment, the invention is directed to a method of detecting a GapC 

protein in a biological sample, comprising: 

(a) reacting the sample with antibodies directed against the GapC protein under 
conditions which allow the antibodies to bind to the GapC protein, when present in the 
sample, to form an antibody/antigen complex; and 
15 (b) detecting the presence or absence of the complex, thereby detecting the presence 

or absence of a GapC protein in the sample. 

Li another embodiment, the invention is directed to an immunodiagnostic test kit for 
detecting Streptococcus infection, the test kit comprising a GapC protein, or antibodies * 
directed against a GapC protein, and instructions for conducting the immunodiagnostic test 
20 These and other embodiments of the subject invention will readily occur to those of 

skill in the art in view of the disclosure herein. 

Brief Description of the Figures 

Figures 1 A- IB depict the isolated nucleotide sequences and deduced amino acid 
25 sequences of the gapC gene for S. dysgalactiae (SEQ ID NO:3 and SEQ ID NO:4). 

Figures 2A-2B depict the isolated nucleotide sequences and deduced amino acid 
sequences of the gapC gene for S. agalactiae (SEQ ID NO:5 and SEQ ID NO:6). 

Figures 3 A-3B depict the isolated nucleotide sequences and deduced amino acid 
sequences of the gapC gene for S. uteris (SEQ ID NO:7 and SEQ ID NO:8). 
30 Figures 4A-4B depict the isolated nucleotide sequences and deduced amino acid 

sequences of the gapC gene for S. parauberis (SEQ ID NO:9 and SEQ ID NO: 10). 

Figures 5 A-5B depict the isolated nucleotide sequences and deduced amino acid 
sequences of the gapC gene for S. iniae (SEQ ID NO: 11 and SEQ ID NO: 12). 
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Figures 6A-6E shows a DNA alignment created by PileUp and displayed by Pretty 
software (a component of the GCG Wisconsin Package, version 1 0, provided by the 
SeqWeb sequence analysis package, version 1.1, of the Canadian Bioinformatics Resource). 
The figure depicts the isolated nucleotide sequences of the gapC genes from S. dysgalactiae 
(DysGapC, Check 9344), S. agalactiae (AgalGapC. Check 2895), S. uberis (UberGapC, 
Check 5966), S. parauberis (PUberGapC, Check 9672), and S. iniae (IniaeGapC, Check 
990). The previously known sequences of S. equisimilis (SeqGapC, Check 5841 ), S. 
pyogenes (SpyGapC, Check 4037), and a bovine GAPDH protein (BovGapC, check 5059) 
are also included. The length and weight parameters were the same for all sequences (10 1 8 
and 1.00, respectively). The parameters used in the DNA sequence comparison were as 
follows: Plurality-2.00; Threshold-1; Ave Weight- 1.00; AveMateh~1.00; AvMisMatch- 
0.00; Symbol comparison table~pileupdna.cmp; CompCheck-6876; GapWeight-5; 
GapLengthWeight-1; PileUp MSF-1018; Type-N; Check--3804. In the figure, dashes ' 
represent identical nucleotides; dots represent gaps introduced by the software used to 
generate the alignment chart, and tildes represent regions not included in the overall 
alignment due to differences in the length of the gene sequences. 

Figures 7 A-7B shows an amino acid sequence alignment created by PileUp and 
displayed by Pretty (as above) that depicts the deduced amino acid sequences of the GapC 
proteins from S. dysgalactiae (DysGapC, Check 673 1) , S. agalactiae (AgalGapC, Check 
1229), S. uberis (UberGapC, Check 8229), S. parauberis (PUberGapC, Check 8889) and S. 
iniae (IniaeGapC, check 8785). The previously known sequences of S. equisimilis 
(SeqGapC, Check 8252), S. pyogenes (SpyGapC, Check 6626) and a bovine GAPDH 
protein (BovGapC, Check 8479) are also included In the figure, dashes represent identical 
amino acid residues; dots represent gaps introduced by the PileUp software, and tildes 
represent regions not included in the overall alignment due to differences in the length of 
the gene sequences. 

Figure 8 shows Kyte-Doolittle hydropathy plots (averaged over a window of 7), 
Emini surface probability plots, Karplus-Schulz chain flexibility plots, Jameson- Wolf 
antigenic index plots, and both Chou-Fasman and Gamier-Osguthorpe-Robson secondary 
structure plots for the GapC protein isolated from S. dysgalactiae. 

Figure 9 shows Kyte-Doolittle hydropathy plots (averaged over a window of 7), 
Emini surface probability plots, Karplus-Schulz chain flexibility plots, Jameson-Wolf 
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antigenic index plots, and both Chou-Fasman and Garnier-Osguthorpe-Robson secondary 
structure plots for the GapC protein isolated from S. agalactiae. 

Figure 1 0 shows Kyte-Doolittle hydropathy plots (averaged over a window of 7), 
Emini surface probability plots, Karplus-Schulz chain flexibility plots, Jameson-Wolf 
5 antigenic index plots, and both Chou-Fasman and Gainier-Osguthorpe-Robson secondary 
structure plots for the GapC protein isolated from S. uteris. 

Figure 1 1 shows Kyte-Doolittle hydropathy plots (averaged over a window of 7), 
Emini surface probability plots, Karplus-Schulz chain flexibility plots, Jameson- Wolf 
antigenic index plots, and both Chou-Fasman and Garnier-Osguthorpe-Robson secondary 
10 structure plots for the GapC protein isolated from S. parauberis. 

Figure 1 2 shows Kyte-Doolittle hydropathy plots (averaged over a window of 7), 
Emini surface probability plots, Karplus-Schulz chain flexibility plots, Jameson-Wolf 
antigenic index plots, and both Chou-Fasman and Garnier-Osguthorpe-Robson secondary 
structure plots for the GapC protein isolated from S. iniae. 
15 Figure 13 is a diagrammatic representation of the Chou-Fasman secondary structure 

plots for the GapC protein isolated from S. dysgal 

Figure 1 4 is a diagrammatic representation of the Chou-Fasman secondary structure 
plots for the GapC protein isolated from S. agalactiae. 

Figure 15 is a diagrammatic representation of the Chou-Fasman secondary structure 
20 plots for the GapC protein isolated from S. uberis. 

Figure 16 is a diagrammatic representation of the Chou-Fasman secondary structure 
plots for the GapC protein isolated from S. parauberis. 

Figure 17 is a diagrammatic representation of the Chou-Fasman secondary structure 
plots for the GapC protein isolated from S. iniae. 
25 Figure 1 8 shows the results of SDS-polyacrylamide gel electrophoresis of 

recombinant S. dysgalactiae GapC produced in E. coli DE3. Lane 1, molecular weight 
markers (20.5 - 103 kDa; BioRad, Emeryville, CA); lane 2, soluble recombinant S. * 
dysgalactiae GapC purified by Ni-NTA affinity chromatography. Numbers on the left of 
the figure indicate the positions of the molecular weight markers (in kDa). 
30 Figure 19 is a histograph which compares the enzymatic activity of bovine plasmin 

bound to intact S. dysgalactiae cells; bovine plasmin bound to affinity-purified recombinant 
S. dysgalactiae GapC protein; intact S. dysgalactiae cells alone; affinity-purified 
recombinant S. dysgalactiae GapC protein alone; and bovine plasmin alone. Activity was 
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determined by an increase in the absorbance at 405nm following release of paranitroanalide 
from the synthetic substrate chromozine-PL (Roche Diagnostics, Laval, Quebec, Canada). 
The data represents me mean of three individual assays. 

Figure 20 compares the change in the percentage of udder quarters infected with S. 
dysgalactiae over a 7 day period among three experimental groups: (1) unvaccinated control 
animals; (2) animals vaccinated with the Mig Fc binding protein; and (3) animals vaccinated 
with GapC. Infection was defined as recovery of >500 cfu of the S. dysgalactiae per ml of 
millc secretions. 

Figure 21 plots the maximum number of S. dysgalactiae in any udder quarter against 
serum anti-GapC antibody titre (expressed as the reciprocal of the dilution showing activity 
above background levels) . Serum anti-GapC titers were shown to correlate with the 
maximum number of cfu/ml recovered from the mammary glands ® = 0.74, as determined 
using GraphPad Prism software, v. 2.01, from GraphPad Software, Inc., San Diego, CA). 

Figure 22 plots cumulative number of infected mammary quarters against serum 
antibody titer (again expressed as the reciprocal of the dilution showing activity above 
background levels). A strong correlation between the anti-GapC antibody serum antibody 
level and me total number of infected quarters was observed. 

Figure 23 illustrates the recovery of bacteria from GapC-inmiunized animals; data 
points are plotted as mean bacterial recoveries/ml milk vs. time (in days post-challenge). In 
the figure, diamonds (-♦-) represent non-vaccinated animals; squares (-■-) represent low 
titer animals (i.e., animals exhibiting the poorest response against GapC in terms of 
antibody titre), and triangles (-a-) represent high titer animals (i.e., the remaining animals). 

Figure 24 shows recovery of S. dysgalactiae from GapC-immunized animals, plotted 
as the percent of mammary quarters infected vs. time in days post-challenge. In the figure, 
the stippled bars represent non-vaccinated animals; the cross-hatched bars represent low- 
titer animals (i.e., animals exhibiting the poorest response against GapC in terms of 
antibody titre) and the unshaded bars represent high-titer animals (i.e., the remaining 
animals). 

Figure 25 depicts the observed inflammatory response to infection with S. 
dysgalactiae plotted as mean somatic cell counts (SCC) for each experimental group versus 
time in days post challenge. In the figure, diamonds (-♦-) represent unchallenged, 
unvaccinated quarters, squares (-■-) represent challenged, unvaccinated animals, triangles(- 
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a-) represent challenged, Mig-vaccinated animals, and x's (-X-) represent challenged, 
GapC-vaccinated animals. 

Figure 26 illustrates somatic cell counts per mammary quarter on day 1 post- 
challenge. In the figure, the bar represents the mean for each group. Squares (-■-) 
5 represent unvaccinated animals; triangles (-a-) represent GapC-vaccinated animals, and 
inverted triangles (- ▼ -) represent Mig-vaccinated animals. 

Figure 27 depicts the somatic cell counts of non-vaccinated, non-challenged and 
GapC-immunized, high-titer animals (i.e., the four animals exhibiting the highest antibody 
titres of the eight animals in the particular group) for seven days post challenge, plotted as 
10 the log 10 of the mean somatic cell count/ml milk against time in days post challenge. 
Diamonds (-♦-) represent unchallenged, unvaccinated animals, squares (-■-) represent 
challenged, unvaccinated animals, and triangles (-a-) represent challenged, GapC- 
vaccinated animals. 

15 Detailed Description of the Invention 

The practice of the present invention will employ, unless otherwise indicated, 
conventional techniques of molecular biology, microbiology, recombinant DNA 
technology, and immunology, which are within the skill of the art. Such techniques are 
explained fully in the literature. See, e.g., Sambrook, Fritsch & Maniatis, Molecular 

20 Cloning: A Laboratoiy Manual, Vols. I, II and m, Second Edition (1989); Perbal, B., A 
Practical Guide to Molecular Cloning (1984); the series, Methods In Enzymology (S. 
Colowick and N. Kaplan eds., Academic Press, Inc.); and Handbook of Experimental 
; Immunology, Vols. I-IV (D.M. Weir and C.C. Blackwell eds., 1986, Blackwell Scientific 
Publications). 

25 - 

The following amino acid abbreviations are used throughout the text: 
Alanine: Ala (A) Arginme: Arg (R) 

Asparagine: Asn (N) Aspartic acid: Asp (D) 

Cysteine: Cys (C) Glutamine: Gin (Q) 

30 Glutamic acid: Glu (E) Glycine: Gly (G) 

Histidine: His (H) Isoleucine: lie (I) 

Leucine: Leu (L) Lysine: Lys (K) 

Methionine: Met (M) Phenylalanine: Phe (F) 
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Proline: Pro (P) Serine: Ser (S) 

Threonine: Thr (T) Tryptophan: Trp (W) 



Tyrosine: Tyr (Y) Valine: Val (V) 



1. Definitions 

In describing the present invention, the following terms will be employed, and are 
intended to be defined as indicated below. 

It must be noted that, as used in this specification and the appended claims, the 
1 0 singular forms "a", "an" and "the" include plural referents unless the content clearly dictates 
otherwise. Thus, for example, reference to "a Streptococcus GapC protein" includes a 
mixture of two or more such proteins, and the like. 

The terms "GapC protein" and "GapC plasmin binding protein" (used 
interchangeably herein) or a nucleotide sequence encoding the same, intends a protein or a 
1 5 nucleotide sequence, respectively, which is derived from a GapC gene found in a variety of 
Streptococcus species, including, without limitation certain strains of group A streptococci 
(Lottenbery, R., et al., (1987) Infect. Immun. 55: 19 14-1 9 18). The nucleotide sequence of 
representative Streptococcus gapC genes, and the corresponding amino acid sequence of the 
GapC proteins encoded by these genes, are depicted in the Figures. In particular, Figures 1 
20 through 5 depict the isolated nucleotide sequences and isolated amino acid sequences of S. 
dysgalactiae (SEQ ID NO:3 and SEQ ID NO:4, respectively), S. agalactiae (SEQ ID NO:5 
and SEQ ID NO:6, respectively), S. uberis (SEQ ID NO:7 and SEQ ID NO:8, respectively), 
S. parauberis (SEQ ID NO:9 and SEQ ID NO: 10, respectively), and S. iniae (SEQ ID 
NO:l 1 and SEQ ID NO:12, respectively). However, a GapC protein as defined herein is not 
25 limited to the depicted sequences as subtypes of each of these Streptococcus species are 
known and variations in GapC proteins will occur between them. 

Representative gapC genes, derived from S. dysgalactiae, S. agalactiae, S. uberis, 
and S. parauberis, are found in the plasmids pET15bgapC (ATCC No:), pMF521c (ATCC 
No:), pMF521a (ATCC No:), pMF521d (ATCC No:), and pMF521e (ATCC No:), 
30 respectively. 

Furthermore, the derived protein or nucleotide sequences need not be physically 
derived from the gene described above, but may be generated in any manner, including for 
example, chemical synthesis, isolation (e.g., from S. dysgalactiae) or by recombinant 
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production, based on the information provided herein. Additionally, the term intends 
proteins having amino acid sequences substantially homologous (as. defined below) to 
contiguous amino acid sequences encoded by the genes^ which display immunological 
and/or plasmin-binding activity. 

Thus, the terms intend full-length, as well as immunogenic, truncated and partial 
sequences, and active analogs and precursor forms of the proteins. Also included in the 
term are nucleotide fragments of the gene that include at least about 8 contiguous base pairs, 
more preferably at least about 10-20 contiguous base pairs, and most preferably at least 
about 25 to 50, or more, contiguous base pairs of the gene, or any integers between these 
values. Such fragments are useful as probes and in diagnostic methods, discussed more 
fully below. 

The terms also include those forms possessing, as well as lacking, a signal sequence, 
if such is present, as well as the nucleic acid sequences coding therefore. Additionally, the 
term intends forms of the GapC proteins which lack a membrane anchor region, and nucleic 
acid sequences encoding proteins with such deletions. Such deletions may be desirable in 
systems that do not provide for secretion of the protein. Furthermore, the plasmin-binding 
domains of the proteins, may or may not be present. Thus, for example, if the GapC 
plasmin-binding protein will be used to purify plasmin, the plasmin-binding domain will 
generally be retained. If the protein is to be used in vaccine compositions, immunogenic 
epitopes which may or may not include the plasmin-binding domain, will be present. 

The terms also include proteins in neutral form or in the form of basic or acid 
addition salts depending on the mode of preparation. Such acid addition salts may involve 
free amino groups and basic salts may be formed with free carboxyls. Phatmaceutically 
acceptable basic and acid addition salts are discussed further below. In addition, the 
proteins may be modified by combination with other biological materials such as lipids 
(both those occurring naturally with the molecule or other lipids that do not destroy 
immunological activity) and saccharides, or by side chain modification, such as acetylation 
of amino groups, phosphorylation of hydroxyl side chains, oxidation of sulfhydryl groups, 
glycosylation of amino acid residues, as well as other modifications of the encoded primary 
sequence. 

The term therefore intends deletions, additions and substitutions to the sequence, so 
long as the polypeptide functions to produce an immunological response as defined herein. 
In this regard, particularly preferred substitutions will generally be conservative in nature, 
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i.e., those substitutions that take place within a family of amino acids. For example, amino 
acids are generally divided into four families: (1) acidic - aspartate and glutamate; (2) basic 
- lysine, argirrine, histidine; (3) non-polar - alanine, valine, leucine, isoleucine, proline, 
phenylalanine, methionine, tryptophan; and (4) uncharged polar glycine, asparagine, 
5 glutamine, cystine, serine threonine, tyrosine. Phenylalanine, tryptophan, and tyrosine are 
sometimes classified as aromatic amino acids. For example, it is reasonably predictable that 
an isolated replacement of leucine with isoleucine or valine, or vice versa; an aspartate with 
a glutamate or vice versa; a threonine with a serine or vice versa; or a similar conservative 
replacement of an amino acid with a structurally related amino acid, will not have a major 
1 0 effect on the biological activity. Proteins having substantially the same amino acid 

sequence as the reference molecule, but possessing minor amino acid substitutions that do 
not substantially affect the immunogenicity and/or plasmin-binding affinity of the protein, 
are therefore within the definition of the reference polypeptide. 

For example, the polypeptide of interest may include up to about 5-1 0 conservative 
15 or non-conservative amino acid substitutions, or even up to about 15-25 or 20-50 

conservative or non-conservative amino acid substitutions, or any integer between these 
values, so long as the desired function of the molecule remains intact . "/ . 

In this regard, GapC proteins isolated from streptococci exhibit several variable 
regions in their amino acid sequences, located at amino acid positions 62 to 8 1 ; 1 02 to 1 12; 
20 165 to 172; 248 to 271; and 286 to 305. These regions, which in S. dysgalactiae, S. 
agalactiae, S. uteris, S. parauberis and S. iniae exhibit from 1 to 9 amino acid 
substitutions, are likely to be amenable to variation without substantially affecting 
immunogenic or enzymatic function. 

Similarly, substitutions occurring in the transmembrane binding domain, if present, 
25 and the signal sequence, if present, normally will not affect immunogenicity. One of skill in 
the art may readily determine other regions of the molecule of interest that can tolerate 
change by reference to the protein structure data shown in Figures 8-17 herein. 

The term "streptococcal GapC protein" intends a GapC plasmin-binding protein, as 
defined above, derived from a streptococcal species that produces the same, including, but 
30 not limited to S. dysgalactiae, S. agalactiae, S. uteris, S. parauberis, and S. iniae. For 
example, a "£ dysgalactiae GapC protein" is a GapC plasmin-binding protein as defined 
above, derived from S. dysgalactiae. Similarly, an "S. agalactiae GapC protein" intends a 
gapC binding protein derived from S. agalactiae. 
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"Wild type" or "native" proteins or polypeptides refer to proteins or polypeptides 
isolated from the source in which the proteins naturally occur. "Recombinant" polypeptides 
refer to polypeptides produced by recombinant DNA techniques; i.e., produced from cells 
transformed by an exogenous DNA construct encoding the desired polypeptide. "Synthetic" 
5 polypeptides are those prepared by chemical synthesis. 

An "isolated" protein or polypeptide is a protein or polypeptide molecule separate 
and discrete from the whole organism with which the molecule is found in nature; or a 
protein or polypeptide devoid, in whole or part, of sequences normally associated with it in 
nature; or a sequence, as it exists in nature, but having heterologous sequences (as defined 
10 below) in association therewith. 

The term "functionally equivalent" intends that the amino acid sequence of a GapC 
plasmin-binding protein is one that will elicit a substantially equivalent or enhanced 
immunological response, as defined above, as compared to the response elicited by a GapC 
plasmm-binding protein having identity with the reference GapC plasmin-binding protein, 
15 or an immunogenic portion thereof. 

The term "epitope" refers to the site on an antigen or hapten to which specific B cells 
and/or T cells respond. The term is also used interchangeably with "antigenic determinant" 
or "antigenic determinant site." Antibodies that recognize the same epitope can be 
identified in a simple immunoassay showing the ability of one antibody to block the binding 
20 of another antibody to a target antigen. 

The terms "immunogenic" protein or polypeptide refer to an amino acid sequence 
which elicits an immunological response as described above. An "immunogenic" protein or 
polypeptide, as used herein, includes the full-length sequence of the GapC plasmin-binding 
protein in question, with or without the signal sequence, membrane anchor domain and/or 
25 plasmin-binding domain, analogs thereof, or immunogenic fragments thereof By 

"immunogenic fragment" is meant a fragment of a GapC plasmin-binding protein which 
includes one or more epitopes and thus elicits the immunological response described above. 
Such fragments can be identified using any number of epitope mapping techniques, well 
known in the art. See, e.g., Epitope Mapping Protocols in Methods in Molecular Biology, 
30 Vol. 66 (Glenn E. Morris, Ed., 1996) Humana Press, Totowa, New Jersey. For example, 
linear epitopes may be determined by e.g., concurrently synthesizing large numbers of 
peptides on solid supports, the peptides corresponding to portions of the protein molecule, 
and reacting the peptides with antibodies while the peptides are still attached to the 
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supports. Such techniques are known in the art and described in, e.g., U.S. Patent No. 
4,708,871; Geysen et al. (1984) Proc Natl Acad. Set USA 81:3998-4002; Geysen et al. 
(1986) Molec. Immunol 23:709-715. Similarly, conformational epitopes are readily 
identified by determining spatial conformation of amino acids such as by, e.g., x-ray 
5 crystallography and 2-dimensional nuclear magnetic resonance. See, e.g., Epitope Mapping 
Protocols, supra. Antigenic regions of proteins can also be identified using standard 
antigenicity and hydropathy plots, such as those calculated using, e.g., the Omiga version 
1 .0 software program available from the Oxford Molecular Group. This computer program 
employs the Hopp/Woods method, Hopp et al., Proa Natl Acad. Sci USA (1981) 78:3824- 

1 0 3828 for determining antigenicity profiles, and the Kyte-Doolittle technique, Kyte et al., J. 
Mol Biol (1982) 157:105-132 for hydropathy plots. Figures 8 to 12 herein depict Kyte- 
Doolittle profiles for representative proteins encompassed by the invention. 

Immimogenic fragments, for purposes of the present invention, will usually include 
at least about 3 amino acids, preferably at least about 5 amino acids, more preferably at least 

15 about 10-15 amino acids, and most preferably 25 or more amino acids, of the parent GapC 
plasmin-binding-binding protein molecule. There is no critical upper limit to the length of 
the fragment, which may comprise nearly the fiill-length of the protein sequence, or even a 
fusion protein comprising two or more epitopes of GapC. 

An "immunogenic composition" is a composition that comprises an antigenic 

20 molecule where administration of the composition to a subject results in the development in 
the subject of a humoral and/or a cellular immune response to the antigenic molecule of 
interest. 

By "subunit vaccine composition" is meant a composition containing at least one 
immunogenic polypeptide, but not all antigens, derived from or homologous to an antigen 

25 from a pathogen of interest. Such a composition is substantially free of intact pathogen 
cells or particles, or the lysate of such cells or particles. Thus, a "subunit vaccine 
composition" is prepared from at least partially purified (preferably substantially purified) 
immunogenic polypeptides from the pathogen, or recombinant analogs thereof A subunit 
vaccine composition can comprise the subunit antigen or antigens of interest substantially 

30 free of other antigens or polypeptides from the pathogen. 

By "pharmaceutically acceptable" or "pharmacologically acceptable" is meant a 
material which is not biologically or otherwise undesirable, i.e., the material may be 
ad mini stered to an individual in a formulation or composition without causing any 
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undesirable biological effects or interacting in a deleterious manner with any of the 
components of the composition in which it is contained. 

An "immunological response" to a composition or vaccine is the development in the 
host of a cellular and/ or antibody-mediated immune response to the composition or vaccine 
5 of interest. Usually, an "immunological response" includes but is not limited to one or more 
of the following effects: the production of antibodies, B cells, helper T cells, suppressor T 
cells, and/or cytotoxic T cells and/or y8 T cells, directed specifically to an antigen or 
antigens included in the composition or vaccine of interest Preferably, the host will display 
either a therapeutic or protective immunological response such that resistance of the 

1 0 mammary gland to new infection will be enhanced and/or the clinical severity of the disease 
reduced. Such protection will be demonstrated by either a reduction or lack of symptoms 
normally displayed by an infected host and/or a quicker recovery time. 

By "nucleic acid immunization" is meant the introduction of a nucleic acid molecule 
encoding one or more selected antigens into a host cell, for the in vivo expression of an 

15 antigen, antigens, an epitope, or epitopes. The nucleic acid molecule can be introduced 

directly into a recipient subject, such as by injection, inhalation, oral, intranasal and mucosal 
administration, or the like, or can be introduced ex vivo, into cells which have been removed 
from the host. In the latter case, the transformed cells are reintroduced into the subject 
where an immune response can be mounted against the antigen encoded by the nucleic acid 

20 molecule. -;V: 
The term "treatment" as used herein refers to either (1 ) the prevention of infection or 
reinfection (prophylaxis), or (2) the reduction or elimination of symptoms of the disease of 
interest (therapy). 

By "mastitis" is meant an inflammation of the mammary gland in mammals, 
25 including in cows, ewes, goats, sows, mares, and the like, caused by the presence of S. 
uteris. The infection manifests itself by the infiltration of phagocytic cells in the gland. 
Generally, 4 clinical types of mastitis are recognized: (1) peracute, associated with swelling, 
heat, pain, and abnormal secretion in the gland and accompanied by fever and other signs of 
systemic disturbance, such as marked depression, rapid weak pulse, sunken eyes, weakness 
30 and complete anorexia; (2) acute, with changes in the gland similar to those above but 
where fever, anorexia and depression are slight to moderate; (3) subacute, where no 
systemic changes are displayed and the changes in the gland and its secretion are less 
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marked: and (4) subclinical, where the inflammatory reaction is detectable only by standard 
tests for mastitis. 

Standard tests for the detection of mastitis include but are not limited to, the 
California Mastitis Test, the Wisconsin Mastitis Test, the Nagase test, the electronic cell 
count and somatic cell counts used to detect a persistently high white blood cell content in 
milk. In general, a somatic cell count of about 300,000 to about 500,000 cells per ml or 
higher, in milk will indicate the presence of infection. Thus, a vaccine is considered 
effective in the treatment and/or prevention of mastitis when, for example, the somatic cell 
count in milk is retained below about 500,000 cells per ml. For a discussion of mastitis and 
the diagnosis thereof, see, e.g., The Merck Veterinary Manual: A Handbook of Diagnosis, 
Therapy, and Disease Prevention and Control for the Veterinarian, Merck and Co., 
Railway, New Jersey, 1991. 

By the terms-'vertebrate," "subject," and "vertebrate subject" are meant any member 
of the subphylum Chordata, including, without hmitation, mammals such as cattle, sheep, 
15 pigs, goats, horses, and humans; domestic animals such as dogs and cats; and birds, 

including domestic, wild and game birds such as cocks and hens including chickens, turkeys 
and other gallinaceous birds; and fish. The term does not denote a particular age. Thus, 
both adult and newborn animals, as well as fetuses, are intended to be covered. 

A "nucleic acid" molecule can include, but is not limited to, procaryotic sequences, 
eucaryotic mRNA, cDNA from eucaryotic mRNA, genomic DNA sequences from 
eucaryotic (e.g., mammalian) DNA, and even synthetic DNA sequences. The term also 
captures sequences that include any of Ihe known base analogs of DNA and UNA. 

An "isolated" nucleic acid molecule is a nucleic acid molecule separate and discrete 
from the whole organism with which the molecule is found in nature; or a nucleic acid 
molecule devoid, in whole or part, of sequences normally associated with it in nature; or a 
sequence, as it exists in nature, but having heterologous sequences (as defined below) in 
association therewith. The term "isolated" in the context of a polynucleotide intends that 
the polynucleotide is isolated from the chromosome with which it is normally associated, 
and is isolated from the complete genomic sequence in which it normally occurs. 

"Purified polynucleotide" refers to a polynucleotide of interest or fragment thereof 
which is essentially free, e.g., contains less than about 50%, preferably less than about 70%, 
and more preferably less man about 90%, of the protein with which the polynucleotide is 
naturally associated. Techniques for purifying polynucleotides of interest are well-known 
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in the art and include, for example, disruption of the cell containing the polynucleotide with 
a chaotropic agent and separation of the polynucleotide(s) and proteins by ion-exchange 
chromatography, affinity chromatography arid sedimentation according to density. 

A "coding sequence" or a "nucleotide sequence encoding" a particular protein, is a 
5 nucleotide sequence which is transcribed and translated into a polypeptide in vitro or in vivo 
when placed under the control of appropriate regulatory elements. The boundaries of the 
coding sequence are determined by a start codon at the 5* (amino) terminus and a translation 
stop codon at the 3 1 (carboxy) terminus. A coding sequence can include, but is not limited 
to, procaryotic sequences, cDNA from eucaryotic mRNA, genomic DNA sequences from 

10 eucaryotic (e.g., mammalian) DNA, and even synthetic DNA sequences. A transcription 

termination sequence will usually be located 3 ! to the coding sequence. A "complementary" 
sequence is one in which the nitrogenous base at a given nucleotide position is the 
complement of the nitrogenous base appearing at the same position in the reference 
sequence. To illustrate, the complement of adenosine is tyrosine, and vice versa; similarly, 

15 cytosine is complementary to guanine, and vice versa; hence, the complement of the 
reference sequence 5 f -ATGCTGA-3* would be 5'-TACGACT-3\ 

A "wild-type" or "native" sequence, as used herein, refers to polypeptide encoding 
sequences that are essentially as they are found in nature, e.g., the S. dysgalactiae GapC 
protein encoding sequences depicted in Figures 1A-1B (SEQ ID NO:4). 

20 "Recombinant" as used herein to describe a nucleic acid molecule means a 

polynucleotide of genomic, cDNA, semisynthetic, or synthetic origin which, by virtue of its 
origin or manipulation: (1) is not associated with all or a portion of the polynucleotide with 
which it is associated in nature; and/or (2) is linked to a polynucleotide other than that to 
which it is linked in nature. The term "recombinant" as used with respect to a protein or 

25 polypeptide means a polypeptide produced by expression of a recombinant polynucleotide. 
"Recombinant host cells," "host cells," "cells," "cell lines," "cell cultures," and other such 
terms denoting procaryotic microorganisms or eucaryotic cell lines cultured as unicellular 
entities, are used interchangeably, and refer to cells which can be, or have been, used as 
recipients for recombinant vectors or other transfer DNA, and include the progeny of the 

30 original cell which has been transfected. It is understood that the progeny of a single 

parental cell may not necessarily be completely identical in morphology or in genomic or 
total DNA complement to the original parent, due to accidental or deliberate mutation. 
Progeny of the parental cell which are sufficiently similar to the parent to be characterized 
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by the relevant property, such as the presence of a nucleotide sequence encoding a desired 
peptide, are included in the progeny intended by this definition, and are covered by the 
above terms. 

"Homology" refers to the percent identity between two polynucleotide or two 
polypeptide moieties. Two DNA, or two polypeptide sequences are "substantially 
homologous" to each other when the sequences exhibit at least about 80%-85%, preferably 
at least about 90%, and most preferably at least about 95%-98% sequence identity over a 
defined length of the molecules. As used herein, substantially homologous also refers to 
sequences showing complete identity to the specified DNA or polypeptide sequence. 

In general, "identity" refers to an exact nucleotide-to-nucleotide or amino acid-to- 
amino acid correspondence of two polynucleotides or polypeptide sequences, respectively. 
Percent identity can be determined by a direct comparison of the sequence information 
between two molecules by aligning the sequences, counting the exact number of matches 
between the two aligned sequences, dividing by the length of the shorter sequence, and 
multiplying the result by 100. Readily available computer programs can be used to aid in 
the analysis, such as ALIGN, Dayhoff, M.O. in Atlas of Protein Sequence and Structure 
M.O. Dayhoff ed., 5 Suppl. 3:353-358, National biomedical Research Foundation, 
Washington, DC, which adapts the local homology algorithm of Smith and Watennan 
(1981) Advances in Appl. Math. 2:482-489 for peptide analysis. Programs for detenrnning 
^ nucleotide sequence identity are available in the Wisconsin Sequence Analysis Package, 
Version 8 (available from Genetics Computer Group, Madison, WI) for example, the 
BESTFIT, FASTA and GAP programs, which also rely on the Smith and Waterman 
algorithm. These programs are readily utilized with the default parameters recommended 
by the manufacturer and described in the Wisconsin Sequence Analysis Package referred to 
above. For example, percent identity of ap^ 

sequence can be determined using the homology algorithm of Smith and Watennan with a 
default scoring table and a gap penalty of six nucleotide positions. 

Another method of establishing percent identity in the context of the present 
invention is to use the MPSRCH package of programs copyrighted by the University of 
Edinburgh, developed by John F. Collins and Shane S. Sturrok, and distributed by 
IntelliGenetics, Inc. (Mountain View, CA). From this suite of packages the Smith- 
Waterman algorithm can be employed where default parameters are used for the scoring 
table (for example, gap open penalty of 12, gap extension penalty of one, and a gap of six). 
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From the data generated the "Match" value reflects "sequence identity." Other suitable 
programs for calculating the percent identity or similarity between sequences are generally 
known in the art, for example, another alignment program is BLAST, used with default 
parameters. For example/ BLASTN and BLASTP can be used using the following default 
5 parameters: genetic code = standard; filter = none; strand = both; cutoff = 60; expect = 10; 
Matrix = BLOSUM62; Descriptions - 50 sequences; sort by = HIGH SCORE; Databases = 
non-redundant, GenBanlc + EMBL + DDBJ + PDB + GenBank CDS translations + Swiss 
protein + Spupdate + PIR. Details of these programs can be found at the following internet 
address: http ://www.ncbi.nhn.gov/cgi-bin/BLAST. 

10 Alternatively, homology can be determined by hybridization of polynucleotides 

under conditions which form stable duplexes between homologous regions, followed by 
digestion with single-stranded-specific nuclease(s), and size determination of the digested 
fragments. DNA sequences that are substantially homologous can be identified in a 
Southern hybridization experiment under, for example, stringent conditions, as defined for 

15 that particular system. Defining appropriate hybridization conditions is within the skill of 
the art. See, e.g., Sambrook et al., supra; DNA Cloning, supra; Nucleic Acid Hybridization, 
supra. 

By the term "degenerate variant"- is intended a polynucleotide containing changes in 
the nucleic acid sequence thereof, that encodes a polypeptide having the same amino acid 
20 sequence as the polypeptide encoded by the polynucleotide from which the degenerate 
variant is derived. 

Techniques for determining amino acid sequence "similarity" are well known in the 
art. In general, "similarity" means the exact amino acid to amino acid comparison of two or 
more polypeptides at the appropriate place, where amino acids are identical or possess 

25 similar chemical and/or physical properties such as charge or hydrophobicity. A so-termed 
"percent similarity" then can be determined between the compared polypeptide sequences. 
Techniques for determining nucleic acid and amino acid sequence identity also are well 
known in the art and include determining the nucleotide sequence of the mRNA for that 
gene (usually via a cDNA intermediate) and determining the amino acid sequence encoded 

30 thereby, and comparing this to a second amino acid sequence. In general, "identity" refers 
to an exact nucleotide to nucleotide or amino acid to amino acid correspondence of two 
polynucleotides or polypeptide sequences, respectively. 



-tSDOCID: <WO 0196381 A2J_> 



SUBSTITUTE SHEET (RULE 26) 



WO 01/96381 



-21- 



PCT/CA01/00838 



10 



15 



A "heterologous" region of a DNA construct is an identifiable segment of DNA 
within or attached to another DNA molecule that is not found in association with the other 
molecule in nature. Thus, when the heterologous region encodes a bacterial gene, the gene 
will usually be flanked by DNA that does not flank the bacterial gene in the genome of the 
source bacteria. Another example of the heterologous coding sequence is a construct where 
the coding sequence itself is not found in nature (e.g., synthetic sequences having codons 
different from the native gene). Allelic variation or naturally occurring mutational events 
do not give rise to a heterologous region of DNA, as used herein. 

A "vector" is a replicon, such as a plasmid, phage, or cosmid, to which another DNA 
segment may be attached so as to bring about the replication of the attached segment. A 
vector is capable of transferring gene sequences to target cells (e.g., bacterial plasmid 
vectors, viral vectors, non-viral vectors, particulate carriers, and liposomes). 

Typically, the terms "vector construct," "expression vector," "gene expression 
vector," "gene delivery vector," "gene transfer vector," and "expression cassette" all refer to 
an assembly which is capable of directing the expression of a sequence or gene of interest. 
Thus, the terms include cloning and expression vehicles, as well as viral vectors. 

These assemblies include a promoter which is operably linked to the sequences or 
gene(s) of interest. Other control elements may be present as well. The expression 
cassettes described herein may be contained within a plasmid construct. In addition to the 
components of the expression cassette, the plasmid construct may also include a bacterial 
origin of replication, one or more selectable markers, a signal which allows the plasmid 
construct to exist as single-stranded DNA (e.g., a M13 origin of replication), a multiple 
cloning site, and a "mammalian" origin of replication .(eg., a SV40 or adenovirus origin of 
replication). 

DNA "control elements" refers collectively to transcription promoters, transcription 
enhancer elements, transcription termination sequences, polyadenylation sequences (located 
3' to the translation stop codon), sequences for optimization of initiation of translation 
(located 5' to the coding sequence), translation termination sequences, upstream regulatory 
domains, ribosome binding sites and the like, which collectively provide for the 
transcription and translation of a coding sequence in a host cell. See e.g., McCaughan et al. 
(1995) PNAS USA 92:5431-5435; Kochetov et al (1998) FEB S Letts. 440:351-355. Not all 
of these control sequences need always be present in a recombinant vector so long as the 
desired gene is capable of being transcribed and translated. 
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"Operably linked" refers to an arrangement of elements wherein the components so 
described are configured so as to perform their usual function. Thus, control elements 
operably linked to a coding sequence are capable of effecting the expression of the coding 
sequence. The control elements need not be contiguous with the coding sequence, so long 
5 as they function to direct the expression thereof Thus, for example, intervening 

untranslated yet transcribed sequences can be present between a promoter and the coding 
sequence and the promoter can still be considered "operably linked" to the coding sequence. 
Similarly, "control elements compatible with expression in a subject" are those which are 
capable of effecting the expression of the coding sequence in that subject. 
10 A control element, such as a promoter, "directs the transcription 11 of a coding 

sequence in a cell when RNA polymerase will bind the promoter and transcribe the coding 
sequence into mRNA, which is then translated into the polypeptide encoded by the coding 
sequence. 

A "host cell" is a cell which has been transformed, or is capable of transformation, 

15 by an exogenous nucleic acid molecule. 

A cell has been "transformed" by exogenous DNA when such exogenous DNA has 
been introduced inside the cell membrane. Exogenous DNA may or may not be integrated 
(covalently linked) into chromosomal DNA making up the genome of the cell. In 
procaiyotes and yeasts, for example, the exogenous DNA may be maintained on an 

20 episomal element, such as a plasmid. With respect to eucaryotic cells, a stably transformed 
cell is one in which the exogenous DNA has become integrated into the chromosome so that 
it is inherited by daughter cells through chromosome replication. This stability is 
demonstrated by the ability of the . eucaryotic cell to establish cell lines or clones comprised 
of a population of daughter cells containing the exogenous DNA. 

25 As used herein, a "biological sample" refers to a sample of tissue or fluid isolated 

from a subject, including but not limited to, for example, blood, plasma, serum, fecal 
matter, urine, bone marrow, bile, spinal fluid, lymph fluid, samples of the fikin 3 external 
secretions of the skin, respiratory, intestinal, and genitourinary tracts, tears, saliva, milk, 
blood cells, organs, biopsies and also samples of in vitro cell culture constituents including 

30 but not limited to conditioned media resulting from the growth of cells and tissues in culture 
medium, e.g., recombinant cells, and cell components. 

As used herein, the terms "label" and "detectable label" refer to a molecule capable 
of detection, including, but not limited to, radioactive isotopes, fluoresces, 
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chenuluminescers, enzymes, enzyme substrates, enzyme cofactors, enzyme inhibitors, 
chromophores, dyes, metal ions, metal sols, ligands (e.g., biotin or haptens) and the like. 
The term "fluorescer" refers to a substance or a portion thereof which is capable of 
exhibiting fluorescence in the detectable range. Particular examples of labels which may be 
5 used under the invention include fluorescein, rhodamine, dansyl, umbeffiferone, Texas red, 
luminol, NADPH and a-P-galactosidase. 

2. Modes of Carrying Out the Invention 

Before describing the present invention in detail, it is to be understood that this 
1 0 invention is not limited to particular formulations or process parameters as such may, of 
course, vary. It is also to be understood that the teiminology used herein is for the purpose 
of describing particular embodiments of the invention only, and is not intended to be 
limiting. 

Although a number of methods and materials similar or equivalent to those 
15 described herein can be used in the practice of the present invention, the preferred materials 

and methods are described herein. 

Central to the present invention is the discovery that the GapC protein is capable of 

eliciting an immune response in a vertebrate subject. In particular, the genes for the GapC 

protein mis', dysgalactiae, S. agalactiae, S. uberis, S. parauberis, and S. iniae have been 
20 isolated, sequenced and characterized, and the protein sequences for those genes have been 

deduced therefrom. The complete DNA sequences for those genes and the corresponding 

amino acid sequences are shown in Figures 1 through 5. 

As described in the examples, the full-length S. dysgalactiae gapC gene, depicted at 

nucleotide positions 1-1011, inclusive, of Figures 1 A-1B, encodes the rail-length S. 
25 dysgalactiae GapC protein of 336 amino acids, shown as amino acids 1-336, inclusive, of 

the same figure. S. dysgalactiae GapC has a predicted molecular weight of about 36 kDa. 

(calculated using the Peptide Sort program of the GCG Wisconsin Package, versionlO, 

provided by the SeqWeb sequence analysis package, version 1.1 of the Canadian 

Bioinformatics Resource). Similarly, the gapC genes isolated from S. agalactiae, S. uberis, 
30 S. parauberis and S. iniae are depicted in Figures 2 through 5; each encodes a full-length 

GapC protein also of 336 amino acids, each also having a predicted molecular weight of 

about 36 kDa. None of the full-length sequences appear to include a signal peptide or a 

membrane anchor region. 



\ 

JSDOC1D: <WO 0196381A2_I_> 



SUBSTITUTE SHEET (RULE 26) 



WO 01/96381 



-24- 



PCT7CA01/00838 



Figures 6 and 7 show an alignment of DNA and amino acid sequences, respectively, 
showing regions of homology and variability that exist among GapC proteins from various 
streptococci strains. In particular, several variable regions are located at amino acid 
positions 62 to 81; 102 to 112; 165 to 172; 248 to 271; and 286 to 305. Such variable 
regions are typically more amenable to change. Hence, amino acid changes in these 
regions, such as substitutions, additions and deletions, are likely tolerated. 

Figures 8 through 12 show plots of the following for each of the GapC proteins of 
the present invention: Kyte-Doolittle hydrophathy, averaged over a window of 7; surface 
probability according to Emini; chain flexibility according to Karplus-Schulz; antigenicity 
index according to Jameson- Wolf; secondary structure according to Garnier-Osguthorpe- 
Robson; secondary structure according to Chou-Fasman; and predicted glycosylation sites. 
Figures 13 through 17 show plots of secondary structure according to Chou-Fasman for 
each of the GapC proteins of the present invention. One of skill in the art can readily use 
the information presented in Figures 8 to 17 to determine immunogenic regions in the 
protein for use in vaccine compositions. 

S. dysgalactiae GapC plasmin-binding protein, immunogenic fragments thereof or 
chimeric proteins including the same, can be provided in subunit vaccine compositions. In 
addition to use in vaccine compositions, the proteins or antibodies thereto can be used as " 
diagnostic reagents to detect the presence of infection in a vertebrate subj ect Similarly, the 
genes encoding the proteins can be cloned and used to design probes to detect and isolate \ * 
homologous genes in other bacterial strains. For example, fragments comprising at least 
about 1 5-20 nucleotides, more preferably at least about 20-50 nucleotides, and most 
preferably about 60-100 nucleotides, or any integer between these values, will find use in 
these embodiments. 

The vaccine compositions of the present invention can be used to treat or prevent a 
wide variety of bacterial infections in vertebrate subjects. For example, vaccine 
compositions including GapC plasmin-binding proteins from S. dysgalactiae, S. uteris, S. 
parauberis, & iniae, and/or group B streptococci (GBS) such as £ agalactiae, can be used 
to treat streptococcal infections in vertebrate subjects that are caused by these or other 
species. In particular, S. uberis and S. agalactiae are common bacterial pathogens 
associated with mastitis in bovine, equine, ovine and goat species. Additionally, group B 
streptococci, such as S. agalactiae, are known to cause numerous other infections in 
vertebrates, including septicemia, meningitis, bacteremia, impetigo, arthritis, urinary tract 
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mfections, abscesses, spontaneous abortion etc. Hence, vaccine compositions containing 
streptococcal GapC plasmin-binding proteins will find use in treating and/or preventing a 
wide variety of streptococcal infections. 

Similarly, GapC plasrmn-binding proteins derived from other bacterial genera such 
5 as Staphylococcus, Mycobacterium, Escherichia, Pseudomonas, Nocardia, Pasteurella, 
Clostridium and Mycoplasma will find use for treating bacterial infections caused by 
species belonging to those genera. Thus, it is readily apparent that GapC plasroin-binding 
proteins can be used to treat and /or prevent a wide variety of bacterial infections in 
numerous species. 

10 The streptococcal GapC plasmm-bmding proteins of the present invention can be 

used in vaccine compositions either alone or in combination with other bacterial, fungal, 
viral or protozoal antigens. These antigens can be provided separately or even as fusion 
proteins comprising one or more epitopes of a GapC plasrnin-binding protein fused to one 
or more of these antigens. For example, other immunogenic proteins from S. uberis, such as 
1 5 the CAMP factor, hyaluronic acid capsule, hyaluronidase, R-like protein and plasminogen 
activator, can be adrninistered with the GapC protein. Additionally, immunogenic proteins 
from other organisms involved in mastitis, such as from the genera Staphylococcus, 
Corynebacterium, Pseudomonas, Nocardia, Clostridium, Mycobacterium, Mycoplasma, 
Pasteurella, Prototheca, other streptococci, coliform bacteria, as well as yeast, can be 
20 administered along with the GapC plasrrim-bmcimg proteins described herein to provide a 
broad spectrum of protection. Thus, for example, immunogenic proteins from 
Staphylococcus aureus, Str. agalactiae, Str. dysgalactiae, Str. zooepidemicus, 
Corynebacterium pyogenes, Pseudomonas aeruginosa, Nocardia asteroides, Clostridium 
perfringens, Escherichia coli, Enterobacter aerogenes and Klebsiella spp. can be provided 
25 along with the GapC plasmin-binding proteins of the present invention. 

Additionally, GapC proteins from different streptococcal species may be used 
together in the vaccine compositions of the present invention. In mis embodiment, the 
multiple GapC proteins may be provided as fusion proteins or as discrete antigens in the 
same or different vaccine compositions. 



30 



Productio n of GapC Plasmin-Rindine Proteins 

The above-described plasmin-binding proteins and active fragments, analogs and 
chimeric proteins derived from the same, can be produced by variety of methods. 
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Specifically, GapC plasmin-binding proteins can be isolated directly from bacteria which 
express the same. This is generally accomplished by first preparing a crude extract which 
lacks cellular components and several extraneous proteins. The desired proteins can then be 
further purified from the cell lysate fraction by, e.g., column chromatography, HPLC, 
5 immunoadsorbent techniques or other conventional methods well known in the art. 

More particularly, techniques for isolating GapC plasmin-binding proteins have 
been described. For example, the GapC protein of S. pyogenes was purified from a crude 
cell extract by precipitation with ammonium sulfate, followed by two cycles of 
chromatography through a Mono FPLC column, and single cycles through superose 12 
10 FPLC, and TSK-phenol HPLC columns (Pancholi, V. and Fischetti, VA (1992) JExptl 

Med 76:415-426). Another technique involves the use of a NAD + -agarose affinity column 
to purify GapC from lysed protoplasts of S. pyogenes strain 64/14 (Winram, SB and 
Lottenberg, R (1996) Microbiol 142:23 1 1-2320). 

Alternatively, the proteins can be recombinantly produced as described herein. As 
15 explained abo ve, these recombinant products can take the form of partial protein sequences, 
full-length sequences, precursor forms that include signal sequences, mature forms without 
signals, or even fusion proteins (e.g., with an appropriate leader for the xecombinant host, or 
with another subunit antigen sequence for Streptococcus or another pathogen). 

In one embodiment of the present invention, the GapC proteins are fused to a 
20 histidine tag, produced by recombinant means, and are then purified from a cell lysate ^ 
fraction using affinity chromatography. See Example 1 A-E, infra. 

The GapC plasmin-binding proteins of the present invention can be isolated based 
on the ability of the protein products to bind plasmin, using plasmin-binding assays as 
described below. See, e.g., the method described in section F.3. of Example 1, infra. Thus, 
25 gene libraries can be constructed and the resulting clones used to transform an appropriate 
host cell. Colonies can be pooled and screened for clones having plasmin-binding activity. 
Colonies can also be screened using polyclonal serum or monoclonal antibodies to the 
plasmin-binding protein. 

Alternatively, once the amino acid sequences are determined, oligonucleotide probes 
30 which contain the codons for a portion of the determined amino acid sequences can be 
prepared and used to screen genomic or cDNA libraries for genes encoding the subject 
proteins. The basic strategies for preparing oligonucleotide probes and DNA libraries, as 
well as their screening by nucleic acid hybridization, are well known to those of ordinary 
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skill in the art. See, e.g., DNA Cloning: Vol. II, supra; Nucleic Acid Hybridization, supra; 
Oligonucleotide Synthesis, supra; Sambrook et al., supra. Once a clone from the screened 
library has been identified by positive hybridization, it can be confirmed by restriction 
enzyme analysis and DNA sequencing that the particular library insert contains GapC 
5 plasmin-binding protein gene or a homolog thereof. The genes can then be further isolated 
using standard techniques and, if desired, PCR approaches or restriction enzymes employed 
to delete portions of the full-length sequence. 

Similarly, genes can be isolated directly from bacteria using known techniques, such 
as phenol extraction and the sequence further manipulated to produce any desired 
10 alterations. See, e.g., Sambrook et al., supra, for a description of techniques used to obtain 
and isolate DNA. 

Alternatively, DNA sequences encoding the proteins of interest can be prepared 
synthetically rather than cloned. The DNA sequences can be designed with the appropriate 
codons for the particular amino acid sequence. In general, one will select preferred codons 

15 for the intended host if the sequence will be used for expression. The complete sequence is 
assembled from overlapping oligonucleotides prepared by standard methods and assembled 
into a complete coding sequence. See, e.g., Edge (1981) Nature 292 :756: Nambair et al. 
(1984) Science 223:1299; Jay etal. (1984) J. Biol Chem. 259:6311. 

Once coding sequences for the desired proteins have been prepared or isolated, they 

20 can be cloned into any suitable vector or replicon. Numerous cloning vectors are known to 
those of skill in the art, and the selection of an appropriate cloning vector is a matter of 
choice. Examples of recombinant DNA vectors for cloning and host cells which they can 
transform include the bacteriophage X (£ coli), pBR322 (E. coli), pACYC177 (E. coli), 
pKT230 (gram-negative bacteria), pGVl 106 (gram-negative bacteria), pLAFRl 

25 (gram-negative bacteria), pME290 (non-E. coli gram-negative bacteria), pHY14 (E. coli and 
Bacillus subtilis), pBD9 {Bacillus), pD61 (Streptomyces), pUC6 (Streptomyces), YIp5 
(Saccharomyces), YCpl9 (Saccharomyces) and bovine papilloma virus (mammalian cells). 
See, Sambrook et aL, supra; DNA Cloning, supra; B. Perbal, supra. 

The gene can be placed under the control of a promoter, ribosome binding site (for 

30 bacterial expression) and, optionally, an operator (collectively referred to herein as "control" 
elements), so that the DNA sequence encoding the desired protein is transcribed into RNA 
in the host cell transformed by a vector containing this expression construction. The coding 
sequence may or may not contain a signal peptide or leader sequence. If a signal sequence • 
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is included, it can either be the native, homologous sequence, or a heterologous sequence. 
For example, the signal sequence for S. dysgalactiae GapC plasmin-binding protein can be 
used for secretion thereof, as can a number of other signal sequences, well known in the art. 
Leader sequences can be removed by the host in post-translational processing. See, e.g., 
5 U.S. Patent Nos. 4,431,739; 4,425,437; 4,338,397. 

Other regulatory sequences may also be desirable which allow for regulation of 
expression of the protein sequences relative to the growth of the host cell. Regulatory 
sequences are known to those of skill in the art, and examples include those which cause the 
expression of a gene to be turned on or off in response to a chemical or physical stimulus, 
10 including the presence of a regulatory compound. Other types of regulatory elements may 
also be present in the vector, for example, enhancer sequences. 

The control sequences and other regulatory sequences may be ligated to the coding 
sequence prior to insertion into a vector, such as the cloning vectors described above. 
Alternatively, the coding sequence can be cloned directly into an expression vector which 
1 5 already contains the control sequences and an appropriate restriction site. 

In some cases it may be necessary to modify the coding sequence so that it may be 
attached to the control sequences with the appropriate orientation; i.e., to maintain the 
proper reading frame. It may also be desirable to produce mutants or analogs of the GapC 
plasmin-binding protein. Mutants or analogs may be prepared by the deletion of a portion 
20 of the sequence encoding the protein, by insertion of a sequence, and/or by substitution of , . 
one or more nucleotides within the sequence. Techniques for modifying nucleotide 
sequences, such as site-directed mutagenesis, are described in, e.g., Sambrook et al., supra; 
DNA Cloning, supra; Nucleic Acid Hybridization, supra. 

The expression vector is then used to transform an appropriate host cell. A number 
25 of mammalian cell lines are known in the art and include immortalized cell lines available 
from the American Type Culture Collection (ATCC), such as, but not limited to, Chinese 
hamster ovary (CHO) cells, HeLa cells, baby hamster kidney (BHK) cells, monkey kidney 
cells (COS), human hepatocellular carcinoma cells (e.g., Hep G2), Madin-Darby bovine 
kidney ("MDBK") cells, as well as others. Similarly, bacterial hosts such as E. coli, 
30 Bacillus subtilis, and Sfreptococcus spp., will find use with the present expression 

constructs. Yeast hosts useful in the present invention include inter alia, Saccharomyces 
cerevisiae, Candida albicans, Candida maltosa, Hansenula polymorpha, Kluyveromyces 
fragilis, Kluyveromyces lactis, Pichia guillerimondii, Pichia pastoris, Schizosaccharomyces 
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pombe and Yarrowia Hpolytica. Insect cells for use with baculovirus expression vectors 
include, inter alia, Aedes aegypti, Autographa californica, Bombyxmori, Drosophila 
melanogaster, Spodopterafrugiperda, and Trichoplusia ni. 

Depending on the expression system and host selected, the proteins of the present 
invention are produced by culturing host cells transformed by an expression vector 
described above under conditions whereby the protein of interest is expressed. The protein 
is then isolated from the host cells and purified. If the expression system secretes the 
protein into the growth media, the protein can be purified directly from the media. If the 
protein is not secreted, it is isolated from cell lysates. The selection of the appropriate 
growth conditions and recovery methods are within the skill of the art. 

The proteins of the present invention may also be produced by chemical synthesis 
such as solid phase peptide synthesis, using known amino acid sequences or amino acid 
sequences derived from the DNA sequence of the genes of interest. Such methods are 
known to those skilled in the art. See, e.g., J. M. Stewart and J. D. Young, Solid Phase 
Peptide Synthesis, 2nd Ed., Pierce Chemical Co., Rockford, IL (1984) and G. Barany and R. 
B. Merrifield, The peptides: Analysis, Synthesis, Biology, editors E. Gross and J. 
Meienhofer, Vol. 2, Academic Press, New York, (1980), pp. 3-254, for solid phase peptide 
synthesis techniques- and M. Bodansky, Principles of Peptide Synthesis, Springer-Verlag, 
Berlin (1984) and E. Gross and J. Meienhofer, Eds., The Peptides: Analysis, Synthesis, 
Biology, supra, Vol. 1, for classical solution synthesis. Chemical synthesis of peptides may 
be preferable if a small fragment of the antigen in question is capable of raising an 
immunological response in the subject of interest. 

The GapC plasmin-binding proteins of the present invention, or their fragments, can 
be used to produce antibodies, both polyclonal and monoclonal. If polyclonal antibodies are 
desired, a selected mammal, (e.g., mouse, rabbit, goat, horse, etc.) is immunized with an 
antigen of the present invention, or its fragment, or a mutated antigen. Serum from the 
immunized animal is collected and treated according to known procedures. See, e.g., 
Jurgens et al. (1985) J. Chrom. 348:363-370. If serum containing polyclonal antibodies is 
used, the polyclonal antibodies can be purified by irnmunoaffinity chromatography, using 
known procedures. 

Monoclonal antibodies to the GapC plasmin-binding proteins and to the fragments 
thereof, can also be readily produced by one skilled in the art. The general methodology for 
making monoclonal antibodies by using hybridoma technology is well known. Immortal 
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antibody-producing cell lines can be created by cell fusion, and also by other techniques 
such as direct transformation of B lymphocytes with oncogenic DNA, or transfection with 
Epstein-Barr virus. See, e.g., M. Schreier et al., Hybridoma Techniques (1980); 
Hammerling et al., Monoclonal Antibodies and T-cell Hybridomas (1981); Kennett et al., 
5 Monoclonal Antibodies (1980); see also U.S. Patent Nos. 4,341,761; 4,399,121; 4,427,783; 
4,444,887; 4,452,570; 4,466,917; 4,472,500, 4,491,632; and 4,493,890. Panels of 
monoclonal antibodies produced against the GapC plasmin-binding proteins, or fragments 
thereof, can be screened for various properties; i.e., for isotype, epitope, affinity, etc. 
Monoclonal antibodies are useful in purification, using immunoaffinity techniques, of the 
1 0 individual antigens which they are directed against. Both polyclonal and monoclonal 
antibodies can also be used for passive immunization or can be combined with subunit 
vaccine preparations to enhance the immune response. Polyclonal and monoclonal 
antibodies are also useful for diagnostic purposes. 

15 Vaccine Formulations and Administration 

The GapC plasmin-binding proteins of the present invention can be formulated into 
vaccine compositions, either alone or in combination with other antigens, for use in 
immunizing subjects as described below. Methods of preparing such formulations are 
described in, e.g., Remington 's Pharmaceutical Sciences, Mack Publishing Company, 

20 Easton, Pennsylvania, 18 Edition, 1990. Typically, the vaccines of the present invention are 
prepared as injectables, either as liquid solutions or suspensions. Solid forms suitable for 
solution in or suspension in liquid vehicles prior to injection may also be prepared. The 
preparation may also be emulsified or the active ingredient encapsulated in liposome 
vehicles. The active immunogenic ingredient is generally mixed with a compatible 

25 pharmaceutical vehicle, such as, for example, water, saline, dextrose, glycerol, ethanol, or 
the like, and combinations thereof. In addition, if desired, the vehicle may contain minor 
amounts of auxiliary substances such as wetting or emulsifying agents and pH buffering 
agents. 

Adjuvants which enhance the effectiveness of the Vaccine may also be added to the 
30 formulation. Adjuvants may include for example, muramyl dipeptides, avridine, aluminum 
hydroxide, dimethyldioctadecyl ammonium bromide (DDA), oils, oil-in-water emulsions, 
saponins, cytokines, and other substances known in the art. 
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The GapC plasmin-binding protein may be linked to a carrier in order to increase the 
immunogenicity thereof. Suitable carriers include large, slowly metabolized macro- 
molecules such as proteins, including serum albumins, keyhole limpet hemocyanin, 
immunoglobulin molecules, thyroglobulin, ovalbumin, and other proteins well known to 
those skilled in the art; polysaccharides, such as sepharose, agarose, cellulose, cellulose 
beads and the like; polymeric amino acids such as polyglutamic acid, polylysine, and the 
like; amino acid copolymers; and inactive virus particles. 

The GapC plasmin-binding proteins may be used in their native form or their 
functional group content may be modified by, for example, succinylation of lysine residues 
or reaction with Cys-thiolactone. A sulfhydryl group may also be incorporated into the 
carrier (or antigen) by, for example, reaction of amino functions with 2-iminotbiolane or the 
N-hydroxysuccinimide ester of 3-(4-dithiopyridyl propionate. Suitable carriers may also be 
modified to incorporate spacer arms (such as hexamethylene diamine or other Afunctional 
molecules of similar size) for attachment of peptides. 

Other suitable carriers for the GapC plasmin-binding proteins of the present 
invention include VP6 polypeptides of rotaviruses, or functional fragments thereof, as 
disclosed inU.S. Patent No. 5,071,651. Also useful is a fusion product of a viral protein 
and the subject immunogens made by methods disclosed in U.S. Patent No. 4,722,840. Still 
other suitable carriers include cells, such as lymphocytes, since presentation in this form 
mimics the natural mode of presentation in the subject, which gives rise to the immunized 
state. Alternatively, the proteins of the present invention may be coupled to erythrocytes, 
preferably the subject's own erythrocytes. Methods of coupling peptides to proteins or cells 
are known to those of skill in the art. J 

Furthermore, the GapC plasmin-binding proteins (or complexes thereof) may be 
formulated into vaccine compositions in either neutral or salt forms. Pharmaceutical^ 
acceptable salts include the acid addition salts (formed with the free amino groups of the ac- 
tive polypeptides) and which are formed with inorganic acids such as, for example, 
hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, tartaric, mandelic, 
and the like. Salts formed from free carboxyl groups may also be derived from inorganic 
bases such as, for example, sodium, potassium, ammonium, calcium, or ferric hydroxides, 
and such organic bases as isopropylamine, trimethylamine, 2-emylamino ethanol, histidine, 
procaine, and the like. 
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Vaccine fonnulations will contain a "therapeutically effective amount" of the active 
ingredient, that is, an amount capable of eliciting an immune response ih a subject to which 
the composition is administered. In the treatment and prevention of mastitis, for example, a 
"therapeutically effective amount" would preferably be an amount that enhances resistance 
5 of the mammary gland to new infection and/or reduces the clinical severity of the disease. 
Such protection will be demonstrated by either a reduction or lack of symptoms normally 
displayed by an infected host, a quicker recovery time and/or a lowered somatic cell count 
in milk from the infected quarter. For example, the ability of the composition to retain or 
bring the somatic cell count (SCC) in milk below about 500,000 cells per ml, the threshold 

1 0 value set by the International Dairy Federation, above which, animals are considered to 
have clinical mastitis, will be indicative of a therapeutic effect 

The exact amount is readily determined by one skilled in the art using standard tests. 
The GapC plasmin-binding protein concentration will typically range from about 1 % to 
about 95% (w/w) of the composition, or even higher or lower if appropriate. With the 

1 5 present vaccine formulations, 5 to 500 jig of active ingredient per ml of injected solution 

should be adequate to raise an immunological response when a dose of 1 to 3 ml per animal 
is administered. 

To immunize a subject, the vaccine is generally administered parenterally, usually ^ 
by intramuscular injection. Other modes of administration, however, such as subcutaneous, 

20 intraperitoneal and intravenous injection, are also acceptable. The quantity to be 

ad min istered depends on the animal to be treated, the capacity of the animal's immune 
system to synthesize antibodies, and the degree of protection desired. Effective dosages can 
be readily established by one of ordinary skill in the art through routine trials establishing 
dose response curves. The subject is immunized by administration of the vaccine in at least 

25 one dose, and preferably two doses. Moreover, the animal may be administered as many 
doses as is required to maintain a state of immunity to infection. 

Additional vaccine fonnulations which are suitable for other modes of 
administration include suppositories and, in some cases, aerosol, intranasal, oral 
formulations, and sustained release fonnulations. For suppositories, the vehicle 

30 composition will include traditional binders and carriers, such as, polyalkaline glycols, or 
triglycerides. Such suppositories may be formed from mixtures containing the active 
ingredient in the range of about 0.5% to about 10% (w/w), preferably about 1% to about 
2%. Oral vehicles include such normally employed excipients as, for example, 
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pharmaceutical grades of mannitol, lactose, starch, magnesium, stearate, sodium saccharin 
cellulose, magnesium carbonate, and the like. These oral vaccine compositions may be 
taken in the form of solutions, suspensions, tablets, pills, capsules, sustained release 
formulations, or powders, and contain from about 10% to about 95% of the active 
ingredient, preferably about 25% to about 70%. 

Intranasal formulations will usually include vehicles that neither cause irritation to 
the nasal mucosa nor significantly disturb ciliary function. Diluents such as water, aqueous 
saline or other known substances can be employed with the subject invention. The nasal 
formulations may also contain preservatives such as, but not limited to, chlorobutanol and 
benzalkonium chloride. A surfactant may be present to enhance absorption of the subject 
proteins by the nasal mucosa. 

Controlled or sustained release formulations are made by incorporating the protein 
into carriers or vehicles such as liposomes, nonresorbable impermeable polymers such as 
ethylenevinyl acetate copolymers and Hytrel® copolymers, sweUable polymers such as 
hydrogels, or resorbable polymers such as collagen and certain polyacids or polyesters such 
as those used to make resorbable sutures. The GapC plasmin-binding proteins can also be - 
delivered using implanted mini-pumps, well known in the art. 

The GapC plasrriin-binding proteins of the instant invention can also be 
administered via a carrier virus which expresses the same. Carrier viruses which will find 
use with me instant invention include but are not limited to the vaccinia and other pox 
viruses, adenovirus, and herpes virus. By way of example, vaccinia virus recombinants 
expressing the novel proteins can be constructed as follows. The DNA encoding the 
particular protein is first inserted into an appropriate vector so that it is adjacent to a 
vaccinia promoter and flanking vaccinia DNA sequences, such as the sequence encoding 
thymidine kinase (TK). This vector is then used to transfect cells which are simultaneously 
infected with vaccinia. Homologous recombination serves to insert the vaccinia promoter 
plus the gene encoding the instant protein into the viral genome. The resulting TK" 
recombinant can be selected by culruring the cells in the presence of 5-bromodeoxyuridine 
and picking viral plaques resistant thereto. 

An alternative route of admimstration involves gene therapy or nucleic acid 
immunization. Thus, nucleotide sequences (and accompanying regulatory elements) 
encoding the subject GapC plasrriin-binding proteins can be administered directly to a 
subject for in vivo translation thereof. Alternatively, gene transfer can be accomplished by 
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transfecting the subject's cells or tissues ex vivo and reintroducing the transformed material 
into the host. DNA can be directly introduced into the host organism, i.e., by injection (see 
International Publication No. WO/90/11092; and Wolff et al. (1990) Science 247:1465- 
1468). Liposome-mediated gene transfer can also be accomplished using known methods. 
See, e.g., Hazinski et al. (1991) Am. J. Respir. CellMol Biol 4:206-209; Brigham et al. 
(1989) Am. J. Med. Set 298:278-281; Canonico et al. (1991) Clin. Res. 39:219A; andNabel 
et al. (1990) Science 249:1285-1288, Targeting agents, such as antibodies directed against 
surface antigens expressed on specific cell types, can be covalently conjugated to the 
liposomal surface so that the nucleic acid can be delivered to specific tissues and cells 
susceptible to infection. 

Diagnostic Assays 

As explained above, the GapC plasmin-binding proteins of the present invention 
may also be used as diagnostics to detect the presence of reactive antibodies of . 
streptococcus, for example S. dysgalactiae, in a biological sample in order to determine the 
presence of streptococcus infection. For example, the presence of antibodies reactive with 
GapC plasmin-binding proteins can be detected using standard electrophoretic and 
immunodiagnostic techniques, including immunoassays such as competition, direct 
reaction, or sandwich type assays. Such assays include, but are not limited to, Western 
blots; agglutination tests; enzyme-labeled and mediated immunoassays, such as ELISAs; 
biotin/avidin type assays; radioimmunoassays; immunoelectrophoresis; 
immunoprecipitation, etc. The reactions generally include revealing labels such as 
fluorescent, chemiluminescent, radioactive, enzymatic labels or dye molecules, or other 
methods for detecting the, formation of a complex between the antigen and the antibody or 
antibodies reacted therewith. 

The aforementioned assays generally involve separation of unbound antibody in a 
liquid phase from a solid phase support to which antigen-antibody complexes are bound. 
Solid supports which can be used in the practice of the invention include substrates such as 
nitrocellulose (e.g., in membrane or microliter well form); polyvinylchloride (e.g., sheets or 
microtiter wells); polystyrene latex (e.g., beads or microliter plates); polyvinylidine 
fluoride; diazotized paper; nylon membranes; activated beads, magnetically responsive 
beads, and the like. 
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Typically, a solid support is first reacted with a solid phase component (e:g., one or 
more GapC plasmin-binding proteins) under suitable binding conditions such that the 
component is sufficiently immobiUzed to the support. Sometimes, immobilization of the 
antigen to the support can be enhanced by first coupling the antigen to a protein with better 
5 binding properties. Suitable coupling proteins include, but are not limited to, 

macromolecules such as serum albumins including bovine serum albumin (BSA), keyhole 
limpet hemocyanin, immunoglobulin molecules, thyroglobulin, ovalbumin, and other 
proteins well known to those skilled in the art. Other molecules that can be used to bind the 
antigens to the support include polysaccharides, polylactic acids, polyglycolic acids, 

1 0 polymeric amino acids, amino acid copolymers, and the like. Such molecules and methods 
of coupling these molecules to the antigens, are well known to those of ordinary skill in the 
art. See, e.g., Brinkley, M.A. Bioconjugate Chem. (1992) 3:2-13; Hashida et al., J. Appl. 
Biochem. (1984) 6:56-63; and Anjaneyulu and Staros, International J. of Peptide and 
Protein Res. (1987) 30:117-124. 

15 After reacting the solid support with the solid phase component, any non- 

immobilized solid-phase components are removed from the support by washing, and the 

support-bound component is men contacted with a biological sample suspected of 
containing ligand moieties (e.g., antibodies toward the immobilized antigens) under suitable 
binding conditions. After washing to remove any non-bound ligand, a secondary binder 
moiety is added under suitable binding conditions, wherein the secondary binder is capable 
of associating selectively with the bound ligand. The presence of the secondary binder can 
then be detected using techniques well known in the art. 

More particularly, an ELISA method can be used, wherein the wells of a microliter 
plate are coated with a GapC plasmin-binding protein. A biological sample containing or 
suspected of containing anti-GapC plasmin-binding protein immunoglobulin molecules is 
then added to the coated wells. After a period of incubation sufficient to allow antibody 
binding to the immobilized antigen, the plate(s) can be washed to remove unbound moieties 
and a detectably labeled secondary binding molecule added. The secondary binding 
molecule is allowed to react with any captured sample antibodies, the plate washed and the 
presence of the secondary binding molecule detected using methods well known in the art. 

Thus, in one particular embodiment, the presence of bound anti-GapC plasmin- 
binding antigen ligands from a biological sample can be readily detected using a secondary 
binder comprising an antibody directed against the antibody ligands. A number of anti- 
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r bovine immunoglobulin (Ig) molecules are known in the art which can be readily 
conjugated to a detectable enzyme label, such as horseradish peroxidase, alkaline 
phosphatase or urease, using methods known to those of skill in the art. An appropriate 
enzyme substrate is then used to generate a detectable signal. In other related embodiments,, 
5 competitive-type ELISA techniques can be practiced using methods known to those skilled 
in the art. 

Assays can also be conducted in solution, such that the GapC plasmin-binding 
proteins and antibodies specific for those proteins form complexes under precipitating 
conditions. In one particular embodiment, GapC plasmin-binding proteins can be attached 

10 to a solid phase particle (e.g., an agarose bead or the like) using coupling techniques known 
in the art, such as by direct chemical or indirect coupling. The antigen-coated particle is 
then contacted under suitable binding conditions with a biological sample suspected of 
containing antibodies for the GapC plasmin-binding proteins. Cross-linking between bound 
antibodies causes the formation of particle-antigen-antibody complex aggregates which can 

15 be precipitated and separated from the sample using washing and/or centrifugation. The 
reaction mixture can be analyzed to determine the presence or absence of antibody-antigen 
complexes using any of a number of standard methods, such as those immunodiagnostic 
methods described above. ~ - - 

In yet a further embodiment, an immunoaffinity matrix can be provided, wherein a 

20 polyclonal population of antibodies from a biological ^sample suspected of containing anti- 
GapC plasmin-binding molecules is immobilized to a substrate. In this regard, an initial 
affinity purification of the sample can be carried out using immobilized antigens. The 
resultant sample preparation will thus only contain anti-streptococcus moieties, avoiding 
potential nonspecific binding properties in the affinity support. A number of methods of 

25 immobilizing immunoglobulins (either intact or in specific fragments) at high yield and 
good retention of antigen binding activity are known in the art. Not being limited by any 
particular method, immobilized protein A or protein G can be used to immobilize 
immunoglobulins. 

Accordingly, once the immunoglobulin molecules have been immobilized to provide 
30 an immunoaffinity matrix, labeled GapC plasmin-binding proteins are contacted with the 
bound antibodies under suitable binding conditions. After any non-specifically bound . 
antigen has been washed from the immunoaffinity support, the presence of bound antigen 
can be determined by assaying for label using methods known in the art. 
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Additionally, antibodies raised to the GapC plasmin-binding proteins, rather than the 
GapC plasmin-binding proteins themselves, can be used in the above-described assays in 
order to detect the presence of antibodies to the proteins in a given sample. These assays 
are performed essentially as described above and are well known to those of skill in the art. 
5 The above-described assay reagents, including the GapC plasrmn-binding proteins, 

or antibodies tiiereto, can be provided in lots, with suitable instructions and other necessary 
reagents, in order to conduct immunoassays as described above. The kit can also contain, 
depending on the particular immunoassay used, suitable labels and other packaged reagents 
and materials (i.e. wash buffers and the like). Standard immunoassays, such as those 
10 described above, can be conducted using these kits. 

Deposits of Strains TT seful in Practicing the Tnvention 

A deposit of biologically pure cultures of the following strains was made with the 
American Type Culture Collection, 10801 University Boulevard, Manassas, Virginia, under 

15 the provisions of the Budapest Treaty. The accession number indicated was assigned after 
successful viability testing, and the requisite fees were paid. The designated deposits will 
be maintained for a period of thirty (30) years from the date of deposit, or for five (5) years 
after the last request for the deposit, whichever is longer. Should a culture become 
nonviable or be inadvertently destroyed, or, in the case of plasimd-containing strains, lose 

20 its plasmid, it will be replaced with a viable culture(s) of the same taxonomic description. 

Should there be a discrepancy between me sequence presented in the present 
application and the sequence of the gene of interest in the deposited plasmid due to routine 
sequencing errors, the sequence in the deposited plasmid controls. 

25 s£2n iaI PlaSmid GCne Deposit Date ATCC No. 

m?fr«re pET15bgapC SapC May 31, 2000 PTA-1976 

BL21 DE3 (s. dysgalactiae) 

30 bVT™ PMF521C May 31, 2000 PTA-1975 
BL21 DE3 (s m agalactiae) 

BlSfnm pMF521a *f C May 31, 2000 . PTA-1973 

BL21DE3 (S. : uteris) 
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Kcoli pMF521e gapC May 31, 2000 - PTA-1972- 

BL21 DE3 (S: iniae) 

3. Experimental 

Below are examples of specific embodiments for carrying out the present invention. 
The examples are offered for illustrative purposes only, and are not intended to limit the 
scope of the present invention in any way. 

Efforts have been made to ensure accuracy with respect to numbers used (e.g., 
amounts, temperatures, etc.), but some experimental error and deviation should, of course, 
be allowed for. 

Materials and Methods 

Enzymes were purchased from commercial sources, and used according to the 
manufacturers' directions. 

In the isolation of DNA fragments, except where noted, all DNA manipulations 
were done according to standard procedures. See, Sambrook et al., supra. Restriction 
enzymes, T 4 DNA ligase, E. coli 9 DNA polymerase II, Klenow fragment, and other 
biological reagents can be purchased from commercial suppliers and used according to the 
manufacturers' directions. Double stranded DNA fragments were separated on agarose gels. 

Sources for chemical reagents generally include Sigma Chemical Company, St. - 
Louis, MO; Alrich, Milwaukee, WI; Roche Molecular Biochemicals, Indianapolis, IN. 

EXAMPLE 1 

Preparation, Amplification- Sequencing. Expression. Purification and Characterization of 
the S. dvszalactiae GapC Plasmin Binding Protein 

A. Preparation of S. dvszalactiae Chromosomal DNA 

A clinical S. dysgalactiae isolate from a case of bovine mastitis (ATCC Accession 
No. ATCC43078) was obtained from the American Type Culture Collection (10801 
University Boulevard, Manassas, VA 201 10-2209), and was used as a source of DNA. The 
organism was routinely grown on TS A sheep blood agar plates (PML Microbiologicals, 
Mississauga, Ontario) at 37° C for 18 hours, or in Todd-Hewitt broth (Oxoid Ltd., 
Hampshire, England) supplemented with 0.3% yeast extract (THB-YE) at 37° C, 5% CQ 2 . 
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Chromosomal DNA was prepared from S. dysgalactiae grown in 1 00 ml of THB-YE 
supplemented with 20 inM glycine for approximately 6 hours, until an A 600 of 0.8 to 1.0 was 
reached. Cells were harvested and re-suspended in 50 mM EDTA, 50 mM Tris-HCl, 0.5% 
Tween-20® (Sigma, St. Louis, MO) and supplemented with RNase A (200 mg/ml), 
proteinase K (20 mg/ml), lysozyme (100 mg/ml) and mutanolysin (100 mg/ml). (SIGMA, 
St. Louis, MO). Following bacterial lysis for 30 minutes at 37° C with vigorous shaking, 
guanidine hydrochloride and Tween-2®, pH 5.5, were mixed with the lysate to give a final 
concentration of 0.8 M and 5%, respectively. This mixture was incubated at 50° C for 30 
minutes. The chromosomal DNA was then purified using a Qiagen genomic-tip lOOg 
(Qiagen, Santa Clarita, CA) and precipitated using 0.7 volumes of isopropanol. The 
resulting pellet was washed in 70% ethanol and re-suspended in 0.5 ml 10 mM Tris-HCl, 
pH 8.8. 

EL Amplification and Clon ing of the ,<? dvsvalantine eapC Gene 

The gapC gene was amplified by PCR (See Mullis et al., U.S. Patent No. 4,683,195; 
Mutts, U.S. Patent No. 4,683,202;). The forward primer, ^C7, contained an Ndel 
restriction (SEQ ID NO: 1, shown in Table 1) and the reverse primer, gapClr, contained a 
BamM site (SEQ ID NO:2, shown in Table 1). In the preceding primer sequences, depicted 
in Table 1, underlining denotes nucleotides added to the original sequence, and holding 
20 indicates the location of restriction endonuclease recognition sites. 

PCR was carried out using Vent DNA polymerase (New England Biolabs, 
Mississauga, ON, Canada). 0,7 ug ofS. dysgalactiae chromosomal DNA was incubated in 
a reaction mixture containing 1 uM of each of the preceding primers, 200 uM each of dATP, 
dTTP, dCTP and dGTP, 3mM MgS0 4 , lx concentration of Thermopol buffer (New 
England Biolabs, Mississauga, ON, Canada) and 2 units Vent DNA polymerase. This 
mixture was incubated for 3 amplification cycles of 1 minute at 94°C, 3 minutes at 50°C 
and lminute, 10 seconds at 72°C, then for 27 amplification cycles at 15 seconds at 95°C, 
30 seconds at 55°C, and 1 minute at 72°C, and finally for 1 cycle of 5 minutes at 72°C. 
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TABLE1 

Sequence Identification Numbers 
and 

Corresponding Nucleotide and Amino Acid Sequences 



SEQ m NO, 


Name 


Sequence 


1 


Primer gapCl 


5 1 - GG CGG CGG CAT ATG GTA GTT A A A GTT 
GGTATTAACGG-3' 


2 


Primer gapClr 


5*-GC GGA TCC TTA TTT AGC GAT TTT TGC 
AAA GTA CTC -3* 


3 


Streptococcus dysgalactiae 
gapC gene 


(see Figure 1) 


4 


Streptococcus dysgalactiae 
GapC protein 


5 


Stre&tococcus cL&alcLrtifip 
gapC gene 


(see Figure 2) 


6 


Streptococcus agalactiae 
GapC protein 


7 


L>ir&piL/l*Ut,iJua lltJiZTlS 

gapC gene 


(see Figure 3) 


8 


Streptococcus uteris 
GapC protein 


9 


Streptococcus parauberis 
gapC gene 


(see Figure 4) 


10 


Streptococcus parauberis 
GapC protein 


11 


Streptococcus iniae 
gapCgene 


(see Figure 5) 


, 12 


Streptococcus iniae 
GapC protein 
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The gapC PCR product was cloned into the expression vector pETl 5B (Novagen, 
Madison, WI) which had been digested with BamHI and Ndel. Cloning of the PCR product 
into this site results in the addition of an in-frame coding sequence for a hexahistidyl tag to 
5 the gapC coding sequence. Subsequent expression yields a full-length protein with an 
attached histidine tag, which permits purification of the protein under non-denaturing 
conditions using metal chelate chromatography. 

This construct was used to transform E. coli BL21 DE3 (Life Technologies, 
Gaithersburg, MD). This transformed strain was designated BL21 DE3 (pET15bgapC) 
10 (ATCC No: ). 

°-* Isolation of Chromosomal DNA and Am p lification and Cloning of the gflP (7fien R 
from S. agalactiae S. uberis. S. p arauberis and S. iniae 

1 5 The gapC gene were prepared from other isolates essentially as described above. 

Chromosomal DNA from S. agalactiae, S. uberis, and S. parauberis was isolated 
from strains obtained from the American Type Culture Collection (10801 University 
Boulevard, Manassas, VA 201 10-2209; strains designated ATCC 27541, 9927, and 13386, 
respectively). Chromosomal DNA from S. iniae was isolated from a strain designated 9117 
20 obtamed from Mount Smai Hospital, University of Toronto. 

The primers used to amplify the gapC genes from the Streptococcus strains listed 
above were the same as those used in the case of S. dysgalactiae, i.e., primer gapCl (SEQ 
ID NO: 1) and primer gapClr (SEQ ID NO:2). 

After amplification, the PCR product in each case was cloned into pPCR-Script, 
25 using the cloning protocol described in the PCR-Script Amp cloning Kit (Stratagene, La 

Jolla, CA). The PCR product insert was then excised using Ndel and BamHI md re-cloned 
into those sites in pE15b using conventional cloning protocols (See e.g., Sambrook et al., 
supra.). The plasmids containing the S. agalactiae, S. uberis, S. parauberis and S. iniae 
were designated pMF521c (ATCC No:), pMF521a (ATCC No:), pMF521d (ATCC No:), 
30 and pMF521e (ATCC No:), respectively. 
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IL Nucleotide Sequence of the gapC Gene and Deduced Amino Acid Sequences 

Sequences homologous to gapC of & equisimilis homolog (Gase, et al. (1996) 
European J, of Biochem. 239 :42-51*) were originally identified while sequencing a linked 
but unrelated gene of S. dysgalactiae. To obtain the complete sequence of the S. 
5 dysgalactiae gapC gene, PCR was employed using the primers described above, i.e., primer 
gapCl and primer gapClr. 

The sequence was determined using fluorescence tag terminators on an ABI 373 
DNA automatic sequencer (Applied Biosystems, Emeryville, CA) at the Plant 
Biotechnology Institute (PBI, Saskatoon, Canada). 
10 both 1 depicts the coding sequence of the gapC gene from S. dysgalactiae 

(DysGapC) (SEQ ID NO:3) and the deduced amino acid sequence (SEQ ID NO:4). 

The sequences of the GapC proteins isolated form S. agalactiae, S. uberis, S. 
parauberis, and S. iniae were determined by the same method. 

Figures 2 through 5 depict both the nucleotide sequences and the predicted amino 
15 acid sequences for the S. dysgalactiae GapC protein (DysgalGapC) (SEQ ID NQ:3 and SEQ 
ID NO:4), as well as for the GapC proteins of S. agalactiae (AgalGapC) (SEQ ID NO:5 and 
SEQ ID NO:6), S. uberis (UberGapC) (SEQ ID NO:7 and SEQ ID NO:8), S. parauberis 
(PUberGapC) (SEQ ID NO:9 and SEQ ID NO:10), and S. iniae (IniaeGapC) (SEQ ID 
NO: 1 1 and SEQ ED NO: 12), respectively. 
20 The S. dysgalactiae GapC protein gene depicted in Figures 1A-1B codes for a 336 

amino acid protein which does not appear to contain either a signal sequence or membrane 
anchor domain. A search oftheGenBank database using the BLASTX program revealed 
that the open reading frame was 95.5% homologous to GapC of S. equisimilis (GenBank 
Accession No. X97788) and 99.4% homologous to GapC of 5. pyogenes (GenBank 
25 Accession No. M95569). The predicted amino acid sequence of the GapC protein also 
exhibited 43% amino acid identity to bovine glyceraldehyde-3-phosphate dehydrogenase 
(GenBank Accession No. U85042). 

Similarly, for the S. agalactiae, S. uberis, S. parauberis and S. iniae GapC protein 
sequences, neither signal sequences nor membrane anchor domains appear to be present. 
30 Sequence homologies are tabulated in Table 2. 
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TABLE 2 

Sequence Homologies Between Various GapC Protein Sequences 





S. equisimilis 


S. pyogenes 


Bovine GAPDH 


S. dysgalactiae 


95.5% 


94.4% 


43% 


S. agalactiae 


87.02% 


91.07% 


(not determined) 


S. paraufoeris 


86.31% 


90.77% 


S. uberis 


88.39% 


92.26% 


S. iniae 


86.31% 


89.88% 



10 



15 



20 



25 



30 



^ Expression and Purification of the Rennmhi n ant S. dv^nlhc tiae GanC Plasma 
Binding Protein 

The Hexahistidyl-tagged GapC protein was expressed and purified under non- 
denaturing conditions using metal chelate (Ni-NTA) affinity chromatography. 

E. coli BL21 DE3 containing the recombinant plasmid was grown in Luria Broth, 
containing 100 ug/ml ampicillin to an A,,,,, of approximately 0.5. Expression of the GapC 
protein was then induced by the addition of 1 mM isopropyl-p ,D-thiogalactoside (IPTG) 
(Sigma, St. Louis, MO]. Following three hours incubation at 37°C, cells were harvested, 
washed in column buffer (50 mM sodium phosphate buffer, pH8.0, 0.3 M NaCl, 10 mM 
imidazole) and lysed by sonication. 

Approximately 40% of the recombinant protein was in tlie soluble fraction of the 
cell sonicate with a yield of approximately 50 mg of the recombinant protein per litre of 
culture volume, determined with a DC Protein Assay Kit (Bio-Rad Laboratories, 
Mississauga, ON, Canada) using bovine serum albumin (Pierpe, Rockford, IL) as a 
standard. 

The lysate was cleared by centrifugation and the soluble fraction was applied to a 
Ni-NTA column (Qiagen), which was subsequently washed with 10 column volumes of 
column buffer (as above, except containing 20 mM imidazole). The Hexahistidyl-tagged 
GapC was eluted using column buffer (as above, except containing 250 mM imidazole), 
yielding a homogenous protein fraction having a GapC concentration of 10-15 mg/ml. That 
fraction was dialyzed against 2000 volumes of PBS A (136mM sodium chloride, 2.6mM 
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potassium chloride/ 8. ImM sodium phosphate dibasic, 1.46 mM potassium phosphate 
monobasic). 

Ei Expression and Purification of Recombinant GapC Protein from S. azalactiae^ S. 
uberis. S. varauberis* and£ iniae 

Expression and purification of the recombinant proteins from these streptococcus 
species is accomplished by the same methods described in Example IE, above. The 
transformed bacterial strains used to express the S. agalactiae, £ uberis, S. parauberis and 
S. iniae recombinant GapC proteins were designated BL21 DE3 (pMF521c) (ATCC No:), 
BL21 DE3 (pMF521a) (ATCC No:), BL21 DE3 (pMF521d) (ATCC No:), and BL21 
DE3 (pMF521e) (ATCCNo:),.respectively. 

GL Characterization of the Recombinant S. dysgalactiae GapC Protein 
1. SDS-Paee Analysis 

SDS-polyacrylamide gel electrophoresis was performed on a sample of the eluted 
protein using the method described by Laemli (Laemli, U.K. (1970) Nature 227:680-685). 
The results are presented in Figure 18. In the figure: lane 1, molecular weight markers (20.5 
to 103 kDa range; BioRad Laboratories, Emeryville, CA); lane 2, soluble recombinant S. 
dysgalactiae GapC protein purified by Ni-NTA affinity chromatography. 

These results demonstrate that purification by affinity chromatography on a Ni-NTA 
column yielded a homogenous protein fraction. 

2^ Glvcerald ehvde-3-phosphate dehydrogenase f GAPDID activity of 
recombinant GapC and S. dysgalactiae whole cells 

GAPDH catalyzes the oxidative phosphorylation of D-glyceraldehyde-3-phosphate 
to 1 , 3-diphosphoglycerate in the presence of NAD + and inorganic phosphate. The high 
degree of homology of GapC to streptococcal glyceraldehyde-3 -phosphate dehydrogenase 
suggested that GapC may exhibit this enzyme activity. 

The GAPDH activity of S. dysgalactiae whole cells (10 10 CFU) and the recombinant 
GapC protein was determined by measuring the reduction of NAD + to NADH. The assay 
buffer was composed of 40 mM triethanolamihe, 50 mM Na2 HP0 4 and 5 mM EDTA, pH 
6.8. S. dysgalactiae cells or 5 mg of purified recombinant protein were incubated in assay 
buffer containing 7 ml glyceraldehyde-3-phosphate (49 mg/ml; Sigma Chemical Company), 
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75 microliters NAD + (15 mM; Sigma Chemical Company) in a final volume of 1 ml. 
Negative controls consisted of samples which did not contain glyeraldehyde-3 -phosphate or 
the recombinant GapC molecule/A dysgalactiae cells. The reduction of NAD + to NADH 
was monitored spectrophotometrically at an Absorbance of 340 nanometers. 

The results indicated that both the recombinant protein as well as intact wild-type S. 
dysgalactiea cells had en2ymatic activity (not shown). Furthermore, when S. dysgalactiae 
cells were treated with Trypsin to digest surface proteins, GAPDH activity disappeared. 
Thus, the enzymatic activity observed for the intact wild type cells was not due to 
intracellular GAPDH. 

This data suggests that the GapC protein is localized on the cell surface despite the 
apparent lack of either a signal sequence or a membrane anchoring region in either the 
nucleotide or amino acid. 

3. Plasmin-Binding Activity of Recombinant S. dysgalactiae GapC Plasmin 
Binding Protein and S. dysgalactiae whole cells. 

A microplate assay was used to determine if the recombinant GapC protein was 
capable of binding bovine plasmin, and if so whether the bound plasmin was in an 
enzymatically active form. 

Ninety-six-well microtiter plates were coated with 5 mg of purified recombinant 
GapC protein, washed 3 times with 0.1% gelatin-PBSA with 0.05% TWEEN-20 (PBSGT). 
The wells were blocked for one hour at 37° C in the same buffer, washed, and incubated 
with 200 ml of bovine plasmin (0.25mg/ml; Boehringer Mannheim, Indianapolis, IN) for 1 
hr at 37° C. The wells were then washed 8 times with PBSGT. 200 ml of the synthetic 
substrate chromazine-PL (Tos-Gly-Pro-Lys-4-NA, 0.3 mg/ml) were added to the wells and 
incubated at 37° C for one hour. The presence of associated plasmin activity was 
determined by measuring the level of paranitroanalide (4-nitranihne) released into the 
supernatant and detected based on an Absorbance of 405 nanometers. A similar procedure 
was used to measure plasmin-binding activity of S. dysgalactiae whole cells, with the 
exception that 10 10 cells were washed with PBSGT and re-suspended in 400 \xl chromazine- 
PL (0.3mg/ml) and incubated for 1 hour at 37°C. 

The results, shown in Figure 19, demonstrate that the purified recombinant protein 
was capable of binding enymatically active bovine plasmin. Likewise, when S. 
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dysgalactiae whole cells were utilized, similar results were obtained. In the figure, the data 
represents the mean of three individual assays. 

Thus, the plasmin-receptbr is located on the surface of S. dysgalactiae and the 
purified protein retains biological activity. 

EXAMPLE 2 

Immuniz ation with S. dysgalactiae GapC and expe rimental in fection of cattle 
Vaccines were formulated in such a fashion that they contained 50 mg/ml of affinity 
purified recombinant GapC in the oil-based adjuvant VSA3 (VIDO, Saskatoon, 
Saskatchewan, Canada). VSA3 is a combination of Emulsigen Plus™ (MVP Laboratories, 
Ralston, Nebraska) and Dimethyldioctadecyl ammonium bromide (Kodak, Rochester, New 
York). The affinity-purified recombinant GapC protein used for the vaccine preparation is 
shown in Figure 18. 

Twenty-four non-lactating Holsteins with no history of S. dysgalactiae infection 
were obtained from various farms in Saskatchewan, Canada. One week prior to 
vaccination, all animals were treated with Cepha-dry™ (Ayerst Laboratories, Montreal, 
Canada) (300 mg per quarter), in order to clear any infection of the udders prior to the 
vaccination step. 

Groups of 8 animals were immunized subcutaneously with two doses of vaccines c 
containing S. dysgalactiae GapC, Mig (an Fc receptor protein isolated from S. dysgalactiae 
which was evaluated simultaneously), or a placebo with a three-week interval between 
i mmunizations. Two weeks following the second immunization, animals were exposed to 
650 colony forming units of S. dysgalactiae delivered into three quarters with an udder 
infusion cannula. The fourth quarter on each animal served as an un-infective control. 

All animals were examined daily for clinical signs of disease and samples from all 
udder quarters were collected on each day. Samples were observed for consistency and 
somatic cell counts as well as bacterial numbers were determined. 

EXAMPLE 3 
Determination of GapC-specific Antibodies 
GapC-specific antibodies in bovine serum were measured using an enzyme-linked 
immunosorbent assay (ELISA). Briefly, microtiter plates (NUNC, Naperville, Illinois) 
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Ayere coated by adding 1 microgram per well purified recombinant antigen in 50mM sodium 
carbonate buffer, pH 9.6, and incubated overnight at 4° C. The liquid was removed and the 
wells were blocked with 3% bovine serum albumin for 1 hr at 37° C. Serial dilutions of 
bovine serum (from 1 in 4 to 1 in 6,400) were then added to the wells and incubated for 2 
5 hours at room temperature. The wells were aspirated, washed and incubated with 1 00 ml of 
alkaline phosphatase-conjugated goat anti-bovine IgG (Kirkgaard & Perry Laboratories 
Inc.,Gaithersburg, MD) for 1 hr at room temperature. The wells were washed again, and 
1 00 ul of p-nitrophenol phosphate (SIGMA, St. Louis, MO) was added as a substrate to 
detect alkaline phosphatase activity. The absorbance at 405 nanometers was recorded 
10 folio wing 1 hr incubation with the substrate at room temperature. 

Where referred to in the figures, the specific antibody titer is expressed as the 
reciprocal of the dilution showing activity above background levels. 

EXAMPLF. 4 

^ Bacterial Colonization 

Bacteria were enumerated by spreading serial dilutions (1 0° to 10 3 ) directly onto 
TSA sheep blood agar plates followed by overnight incubation at 37°C, 5% C0 2 . 
Colonization is defined as >500 cfu/ml of the challenge organism recovered. 

To confirm that the bacteria recovered from milk secretions were S. dysgalactiae, 
selected colonies recovered from each animal were tested using an API strep-20 test 
(bioMerieux SA, Hazelwood, Missouri) according to the manufacturer's instructions. This 
test is a standardized method combining 20 biochemical tests for the determination of 
enzymatic activity and fermentation of sugars. The reactions are read according to a reading 
table and the identification is obtained by either referring to the analytical profile index or 
25 using identification software. 

Following challenge, animals from all groups were shown to be colonized by S. 
dysgalactiae (Figure 20). Only the GapC-immunized cows had a statistically significant 
reduction in the number of infected quarters and total numbers of bacteria isolated per 
quarter. Therefore, immunization with GapC reduced bacterial colonization following 
30 challenge with S. dysgalactiae. 

The relationship between anti-GapC titer and bacterial colonization is shown in 
Figures 21 and 22. There was a strong correlation between anti-GapC serum antibody level 
and the maximum number of bacteria (expressed in CFU (log I0 )/ml niilk) found in any 
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quarter r « 0.74) (Figure 21) as well as the total number of infected quarters (r 2 = 0.74) 
(Figure 22). Correlation was calculated using GraphPad Prism software, version 2.01 
(GraphPad Software Inc., San Diego, California). 

This correlation is also illustrated in Figure 23 and 24 where the GapC-immunized 
group is subdivided into high titer and low titer responders. In these figures, "low titer 
responders" refer to the four animals with the poorest response against GapC while "high 
titer responders" refer to the remainder of the group. No colonization-occurred in the high 
titer group, while even the low titer group showed reduced numbers of bacteria recovered 
after day 3. 

EXAMPLE 5 
Determination of inflammatory response 

Inflammatory response was measured as a function of somatic cell count (i.e., 
lymphocytes, neutrophils, and monocytes). Somatic cell counts were measured in a Coulter 
counter using standard techniques, as recommended by Agriculture and Agri-Food Canada 
Pamphlet IDF50B (1985) Milk and Milk products-Methods of Sampling, Samples were 
always read within 48 hours of collection and fixation, at days 1 through 7 post challenge. 

The numbers of somatic cells present in the gland was determined on each day post 
challenge. Numbers from the unchallenged quarter remained constant throughout the trial 
while on day 1, the GapC group was lower than the placebo-immunized group (Figure 25). 
The difference between the GapC and the placebo groups was statistically significant. The 
individual data from day 1 is shown in Figure 26; data for GapC-treated animals over a 7 
day period post-challenge is shown in Figure 27. Samples from the quarters of GapC- 
immunized animals were indistinguishable from unchallenged quarters. 

Therefore, immunization with GapC reduced the inflammatory response following 
challenge with S. dysgalactiae. 

EXAMPLE 6 
Cross-protection against S. uberis infection 
During the last 3 days of the vaccine trial, a mixed population of bacteria were 
recovered from the mammary gland secretions. Further analysis with the API 20 Strep test 
confirmed the identity of the strains present in the mixture as S. dysgalactiae and S. uteris, 
the latter representing a natural infection which occurred during the trial. The group which 
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was immunized with GapC appeared to be significantly cross-protected against S. uberis 
colonization (see Figure 20), indicating that GapC may be a broadly cross-protective 
antigen capable of protecting against infection by multiple Streptococcal species. 

Thus, vaccination with S. dysgalactiae GapC is therefore capable of providing cross- 
5 protection against S. uberis infection. 

************************* 

Thus, the cloning, expression and characterization of various GapC plasmin binding 
proteins is disclosed, as are methods of using the same. Although preferred embodiments of 
10 the subject invention have been described in some detail, it is understood that obvious 

variations can be made without departing from the spirit and the scope of the invention as 
defined by the appended claims. 
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What is claimed is : 

1 . An isolated GapC protein selected from the group consisting of: 

(a) an isolated Streptococcus dysgalactiae GapC protein comprising the amino acid 
sequence shown at amino acid positions 1 to 336, inclusive, of Figure 1 A-1B (SEQ ID 
NO:4); 

(b) an isolated Streptococcus agalactiae GapC protein comprising the amino acid 
sequence shown at amino acid positions 1 to 336, inclusive, of Figures 2A-2B (SEQ ID 
NO:6); 

(c) an isolated Streptococcus uberis GapC protein comprising the amino acid 
sequence shown at amino acid positions 1 to 336, inclusive, of Figures 3A-3B (SEQ ID 
NO:8); 

(d) an isolated Streptococcus parauberis GapC protein comprising the amino acid 
sequence shown at amino acid positions 1 to 336, inclusive, of Figures 4A-4B (SEQ ID 
NO:10); . 

(e) an isolated Streptococcus iniae GapC protein comprising the amino acid 
sequence shown at amino acid positions 1 to 336, inclusive, of Figures 5A-5B (SEQ ID 
NO: 12); and 

(f) immunogenic fragments of (a), (b), (c), (d) and (e) comprising at least about 5 
amino acids. 

2. The isolated GapC protein of claim 1, wherein the protein is a Streptococcus 
dysgalactiae GapC protein comprising the amino acid sequence shown at amino acid 
positions 1 to 336, inclusive, of Figures 1A-1B (SEQ ID NO:4). 

3 . The isolated GapC protein of claim 1 , wherein the protein is a Streptococcus 
agalactiae GapC protein comprising the amino acid sequence shown at amino acid 
positions 1 to 336, inclusive, of Figures 2A-2B (SEQ ID NO:6). 

4. The isolated GapC protein of claim 1 , wherein the protein is a Streptococcus 
uberis GapC protein comprising the amino acid sequence shown at amino acid positions 1 
to 336, inclusive, of Figures 3A-3B (SEQ ID NO:8). 
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5. The isolated GapC protein of claim 1, wherein the protein is a Streptococcus 
parauberis GapC protein comprising the amino acid sequence shown at amino acid 
positions 1 to 336, inclusive, of Figures 4A-4B (SEQ ID NO: 10). 

6. The isolated GapC protein of claim 1, wherein the protein is a Streptococcus 
iniae GapC protein comprising the amino acid sequence shown at amino acid positions 1 to 
336, inclusive, of Figures 5A-5B (SEQ ID NO: 12). 

7. An isolated polynucleotide comprising a coding sequence for an isolated GapC 
protein according to any of claims 1-6, or complements thereof. 

8. A recombinant vector comprising: 

. (a) the isolated polynucleotide of claim 7; and 

(b) at least one heterologous control element operably linked to said isolated 
polynucleotide, whereby said coding sequence can be transcribed and translated in a host 
cell, and at least one of said control elements is heterologous to said coding sequence. 

9. A host cell comprising the recombinant vector of claim 8 . 

10. A method for producing a GapC protein, said method comprising culturing the 
cells of claim 9 under conditions for producing said protein. 

1 1 . A vaccine composition comprising a pharmaceutical^ acceptable vehicle and a 
GapC protein, wherein said GapC protein is selected from the group consisting of: 

(a) a Streptococcus dysgalactiae GapC protein comprising the amino acid sequence 
shown at amino acid positions 1 to 336, inclusive, of Figures 1A-1B (SEQ ID NO:4); 

(b) a Streptococcus agalactiae GapC protein comprising the amino acid sequence 
shown at amino acid positions 1 to 336, inclusive, of Figures 2A-2B (SEQ ID NO:6); 

(c) & Streptococcus uteris GapC protein comprising the amino acid sequence shown 
at amino acid positions 1 to 336, inclusive, of Figures 3A-3B (SEQ ID NO:8); 

(d) a Streptococcus parauberis GapC protein comprising the amino acid sequence 
shown at amino acid positions 1 to 336, inclusive, of Figures 4A-4B (SEQ ID NO: 10); 
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(e) a Streptococcus iniae GapC protein comprising the amino acid sequence shown 
at amino acid positions 1 to 336, inclusive, of Figures 5A-5B (SEQ ID NO: 12); 

(f) a Streptococcus GapC protein having at least about 70% sequence identity to (a), 
(b), (c), (d) and (e); and 

5 (g) immunogenic fragments of (a), (b), (c), (d), (e) and (f), said fragments 

comprising at least about 5 amino acids. 

12. The vaccine composition of claim 11, wherein said GapC protein comprises the 
amino acid sequence of the Streptococcus dysgalactiae GapC protein depicted at amino 

10 acid positions 1 to 336, inclusive, of Figures 1 A-1B (SEQ ID NO:4) or an immunogenic 
fragment thereof, said 'fragment comprising at least about 5 amino acids. 

13. The vaccine composition of claim 12, wherein said GapC protein comprises the 
amino acid sequence depicted at amino acid position 1 to 336, inclusive, of Figures 1 A- IB 

15 (SEQE>NO:4). 

14. The vaccine composition of claim 11, wherein said GapC protein comprises the 
amino acid sequence of the Streptococcus agalactiae GapC protein depicted at amino acid 
positions 1 to 336, inclusive, of Figures 2A-2B (SEQ ED NO:6) or an immunogenic 

20 fragment thereof, said fragment comprising at least about 5 amino acids. 

15. The vaccine composition of claim 14, wherein said GapC protein comprises the 
amino acid sequence depicted at amino acid positions 1 to 336, inclusive, of Figures 2A-2B 
(SEQIDNO:6). 



25 



30 



16. The vaccine composition of claim 1 1, wherein said GapC protein comprises the 
amino acid sequence of the Streptococcus uberis GapC protein depicted at amino acid 
positions 1 to 336, inclusive, of Figures 3A-3B (SEQ ID NO:8) or an immunogenic 
fragment thereof, said fragment comprising at least about 5 amino acids. 

17. The vaccine composition of claim 16, wherein said GapC protein comprises the 
amino acid sequence depicted at amino acid positions 1 to 336, inclusive, of Figures 3A-3B 
(SEQ ID NO: 8). 



JSDOCID: <WO 0106381A2_L> 



SUBSTITUTE SHEET (RULE 26) 



WO 01/96381 4 " PCT/CA01/00838 

-53- . 



18. The vaccine composition of claim 1 1 , wherein said GapC protein comprises the 
amino acid sequence of the Streptococcus parauberis GapC protein depicted at amino acid 
positions 1 to 336, inclusive, of Figures 4A-4B (SEQ ED NO: 10) or an immunogenic 
fragment thereof, said fragment comprising at least about 5 amino acids. 

5 \ 

19. The vaccine composition of claim 1 8, wherein said GapC protein comprises the 
amino acid sequence depicted at amino acid positions 1 to 336, inclusive, of Figures 4A-4B 
(SEQ ID NO: 10). 

10 20. The vaccine composition of claim 11, wherein said GapC protein comprises the 

amino acid sequence of the Streptococcus iniae GapC protein depicted at amino acid 
positions 1 to 336, inclusive, of Figures 5A-5B (SEQ ID NO: 12) or an immunogenic 
fragment thereof, said fragment comprising at least about 5 amino acids. 

15 21 . The vaccine composition of claim 20, wherein said GapC protein comprises the 

amino acid sequence depicted at amino acid positions 1 to 336, inclusive, of Figures 5A-5B 
(SEQ ID NO: 12). 

22. The vaccine composition according to any of claims 11-21, further comprising 
20 an adjuvant. 

23. A method of producing a vaccine composition comprising the steps of 

(1) providing a GapC protein or an immunogenic fragment thereof, said fragment 
comprising at least about 5 amino acids, and 
25 (2) combining said protein with a pharmaceutically acceptable vehicle. 

24. The method of claim 23, wherein the GapC protein is the protein of any of 
claims 1-6. 

30 25. Antibodies directed against the isolated GapC protein of any of claims 1-6. 

26. The antibodies of claim 25 wherein said antibodies are polyclonal. 
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27. The antibodies of claim 25 wherein said antibodies are monoclonal. 

28. Use of a GapC protein according to any of claims 1-6, in the manufacture of a 
vaccine composition useful for treating or preventing a bacterial infection in a vertebrate 

5 subject. 

29. The use of claim 28, wherein said bacterial infection is a streptococcus 
infection. 

10 30. The use of claim 29, wherein said bacterial infection causes mastitis. 

31 . Use of a polynucleotide according to claim 7 in the manufacture of a vaccine 
composition for treating or preventing a bacterial infection in a vertebrate subject. 

The use of claim 31, wherein said bacterial infection is a streptococcal infection. 

The use of claim 32, wherein said bacterial infection causes mastitis. 

34. Use of a GapC protein in the manufacture of a diagnostic reagent for detecting 
20 Streptococcus antibodies in a biological sample. 

35. Use of a GapC protein according to any of claims 1-6 in the manufacture of a 
diagnostic reagent for detecting Sfreptococcus antibodies in a biological sample. 

25 36. Use of antibodies according to any of claims 25-27 in the manufacture of a 

diagnostic reagent for detecting a GapC protein in a biological sample. 

37. An immunodiagnostic test kit for detecting Streptococcus infection, said test kit 
comprising a GapC protein and instructions for conducting the immunodiagnostic test. 

30 

38. An immunodiagnostic test kit for detecting Streptococcus infection, said test kit 
comprising antibodies directed against a GapC protein and instructions for conducting the 
immunodiagnostic test. 



15 32. 

33. 
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39. A method of treating or preventing a bacterial infection in a vertebrate subject 
comprising administering to said subject a therapeutically effective amount of a vaccine 
composition according to any of claims 11-22. 

40. The method of claim 39, wherein said bacterial infection is a streptococcus 
infection. 

41. The method of claim 40, wherein said bacterial infection causes mastitis. 

42. A method of treating or preventing a bacterial infection in a vertebrate subject 
comprising administering to said subject a therapeutically effective amount of a 
polynucleotide according to claim 7. 



43. The method of claim 42, wherein said bacterial infection is a streptococcal 
15 infection. ' 



44. The method of claim 43, wherein said bacterial infection causes mastitis. 

45. A method of detecting Streptococcus antibodies in a biological sample, 
20 comprising: 

(a) reacting said biological sample with an isolated GapC protein under conditions 
which allow said Streptococcus antibodies, when present in the biological sample, to bind to 
said GapC protein to form an antibody/antigen complex; and 

(b) detecting me presence or absence of said complex, thereby detecting the presence 
25 or absence of Streptococcus antibodies in said sample. 

46. A method of detecting Streptococcus antibodies in a biological sample, 
comprising: 

(a) reacting said biological sample with an isolated GapC protein according to any of 
30 claims 1-6, under conditions which allow said Streptococcus antibodies, when present in the 

biological sample, to bind to said GapC protein to form an antibody/antigen complex; and 

(b) detecting the presence or absence of said complex, thereby detecting the presence 
or absence of Streptococcus antibodies in said sample. 
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47. A method of detecting a GapC protein in a biological sample, comprising: 

(a) reacting said sample with antibodies directed against the GapC protein under 
conditions which allow said antibodies to bind to said GapC protein, when present in the 
sample, to form an antibody/antigen complex; and 

(b) detecting the presence or absence of said complex, thereby detecting the presence 
or absence of a GapC protein in said sample. 



SUBSTITUTE SHEET (RULE 26) 



WO 01/96381 



PCT/CA01/00838 



1/34 



atg gta gtt aaa gtt ggt att aac ggt ttc ggt cgt ate gga cgt ctt 48 
Met Val Val Lys Val Gly lie Asn Gly Phe Gly Arg lie Gly Arg Leu 
1 5 10 15 

gca ttc cgt cgt att caa aat gtt gaa ggt gtt gaa gta act cgt ate 96 
Ala Phe Arg Arg lie Gin Asn Val Glu Gly Val Glu Val Thr Arg lie 
20 25 30 

aac gac ctt aca gat cca aac atg ctt gca cac ttg ttg aaa tac gat 144 
Asn Asp Leu Thr Asp Pro Asn Met Leu Ala His Leu Leu Lys Tyr Asp 
35 40 45 

aca act caa gga cgt ttt gac gga act gtt gaa gtt aaa gaa ggt gga 192 
Thr Thr Gin Gly Arg Phe Asp Gly Thr Val Glu Val Lys Glu Gly Gly 
50 55 60 

ttt gaa gta aac gga aac ttc ate aaa gtt tct get gaa cgt gat cca 240 
Phe Glu Val Asn Gly Asn Phe lie Lys Val Ser Ala Glu Arg Asp Pro 
65 70 75 80 

gaa aac ate gac tgg gca act gac ggt gtt gaa ate gtt ctg gaa gca 288 
Glu Asn lie Asp Trp Ala Thr Asp Gly Val Glu lie Val Leu Glu Ala 

85 90 95 

act ggt ttc ttt get aaa aaa gaa get get gaa aaa cac tta cat get 336 
Thr Gly Phe Phe Ala Lys Lys Glu Ala Ala Glu Lys His Leu His Ala 
100 105 110 

aac ggt get aaa aaa gtt gtt ate aca get cct ggt gga aac gac gtt 3 84 
Asn Gly Ala Lys Lys Val Val lie Thr Ala Pro Gly Gly Asn Asp Val 
115 120 125 

aaa aca gtt gtt ttc aac act aac cac gac att ctt gac ggt act gaa 432 
Lys Thr Val Val Phe Asn Thr Asn His Asp He Leu Asp Gly Thr Glu 
130 135 140 

aca gtt ate tea ggt get tea tgt act aca aac tgt tta get cct atg 4 80 
Thr Val He Ser Gly Ala Ser Cys Thr Thr Asn Cys Leu Ala Pro Met 
145 ISO 155 160 

get aaa get ctt cac gat gca ttt ggt ate caa aaa ggt ctt atg act 52 8 
Ala Lys Ala Leu His Asp Ala Phe Gly He Gin Lys Gly Leu Met Thr 
165 170 175 

aca ate cac get tat act ggt gac caa atg ate ctt gac gga cca cac 576 
Thr lie His Ala Tyr Thr Gly Asp Gin Met He Leu Asp Gly Pro His 
180 185 190 

cgt ggt ggt gac ctt cgt cgt get cgt get ggt get gca aac att gtt 624 
Arg Gly Gly Asp Leu Arg Arg Ala Arg Ala Gly Ala Ala Asn He Val 
195 200 205 



FIG. 1A 
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tea caa ttg gtt aaa gtt gta tea tgg tat gac aat gaa atg tct tac 
Ser Gin Leu Val Lys Val Val Ser Trp Tyr Asp Asn Glu Met Ser Tyr 
305 310 315 320 

act get caa ctt gtt cgt aca ctt gag tac ttt gca aaa ate get aaa 
Thr Ala Gin Leu Val Arg Thr Leu Glu Tyr Phe Ala Lys He Ala Lys 
3 25 330 335 

taa 



FIG. 1B 
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720 



cct aac tea act ggt get get aaa get ate ggt ctt gtt ate cca gaa 

Pro Asn Ser Thr Gly Ala Ala Lys Ala He Gly Leu Val lie Pro Glu 
210 215 220 

ttg aat ggt aaa ctt gat ggt get gca caa cgt gtt cct gtt cca act 

Leu Asn Gly Lys Leu Asp Gly Ala Ala Gin Arg Val Pro Val Pro Thr 
225 230 235 2 40 

gga tea gta act gag ttg gtt gta act ctt gat aaa aac gtt tct gtt 768 

Gly Ser Val Thr Glu Leu Val Val Thr Leu Asp Lys Asn Val Ser Val 

245 250 255 

gac gaa ate aac get get atg aaa get get tea aac gac agt ttc ggt 816 

Asp Glu He Asn Ala Ala Met Lys Ala Ala Ser Asn Asp Ser Phe Gly 
260 265 270 

tac act gaa gat cca att gtt tct tea gat ate gta ggc gtg tea tac 864 

Tyr Thr Glu Asp Pro He Val Ser Ser Asp He Val Gly Val Ser Tyr 
275 280 285 

ggt tea ttg ttt gac gca act caa act aaa gtt atg gaa gtt gac gga 912 

Gly Ser Leu Phe Asp Ala Thr Gin Thr Lys Val Met Glu Val Asp Gly 
290 295 300 
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atg gta gtt aaa gtt ggt att aac ggt ttc ggt cgt ate ggt cgt ctt 
Met Val Val Lys Val Gly lie Aen Gly Phe Gly Arg lie Gly Arg Leu 
1 5 10 15 

gca ttc cgt cgc ate caa aac gta gaa ggt gtt gaa gtt act cgt ate 
Ala Phe Arg Arg lie Gin Asn Val Glu Gly Val Glu Val Thr Arg He 



20 25 



30 



48 



96 



aac gac ctt aca gat cca aac atg ctt gca cac ttg ttg aaa tat gac 144 
Asia. Asp Leu Thr Asp Pro Asn Met Leu Ala His Leu Leu Lys Tyr Asp 
35 40 45 

aca act caa ggt cgt ttc gac ggt act gtt gaa gtt aaa gaa ggt gga 192 
Thr Thr Gin Gly Arg Phe Asp Gly Thr Val Glu Val Lys Glu Gly Gly 
50 55 60 

ttc gaa gtt aac ggt caa ttt gtt aaa gtt tct get gaa cgc gaa cca 240 
Phe Glu Val Asn Gly Gin Phe Val Lys Val Ser Ala Glu Arg Glu Pro 
65 70 75 80 

gca aac att gac tgg get act gat ggc gta gaa ate gtt ctt gaa gca 2 88 
Ala Asn He Asp Trp Ala Thr Asp Gly Val Glu lie Val Leu Glu Ala 

85 90 95 

act ggt ttc ttt gca tea aaa gaa aaa get gga caa cac ate cat gaa 336 
Thr Gly Phe Phe Ala Ser Lys Glu Lys Ala Gly Gin His He His Glu 
10° 105 no 

aat ggt get aaa aaa gtt gtt ate aca get cct ggt gga aac gac gtt 3 84 
Asn Gly Ala Lys Lys Val Val He Thr Ala Pro Gly Gly Asn Asp Val 
US 120 125 

aaa aca gtt gtt ttc aac act aac cac gat ate ctt gat gga act gaa 432 
Lys Thr Val Val Phe Asn Thr Asn His Asp He Leu Asp Gly Thr Glu 
"0 135 140 

aca gtt ate tea ggt get tea tgt act aca aac tgt ctt get cca atg 480 
Thr Val He Ser Gly Ala Ser Cys Thr Thr Asn Cys Leu Ala Pro Met 
145 .-ISO 155 160 

get aaa get tta caa gac aac ttt ggt gtt aaa caa ggt ttg atg act 528 
Ala Lys Ala Leu Gin Asp Asn Phe Gly Val Lys Gin Gly Leu Met Thr 

170 175 

act ate cac gca tac act ggt gac caa atg ate ctt gac gga cca cac 576 
Thr He His Ala Tyr Thr Gly Asp Gin Met He Leu Asp Gly Pro Hie 
180 185 190 

cgt ggt ggt gac ctt cgt cgt get cgt gca ggt get gca aac ate gtt 624 
Arg Gly Gly Asp Leu Arg Arg Ala Arg Ala Gly Ala Ala Asn He Val 
155 200 205 

FIG. 2A 
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t*v»~ o ^ ■ " s www yea aaa auc get aaa 

Thr Ser Gin Leu Val Arg Thr Leu Glu Tyr Phe Ala Lys He Ila Lys 

taa 



325 330 335 



720 



cct aac tea act ggt get gca aaa get ate gga ctt gtt ate cca aaa 67, 
Pro Asn Ser Thr Gly Ala Ala Lys Ala He Sy Leu Si lie PrJ 2lu 
210 215 

1 2 a-* I?*" f Sa f tfc 9at 3gt gCt 3Ca caa C 9 C 9 fct cct 9tt cca act 
Leu Asn Gly Lys Leu Asp Gly Ala Ala Gin Arg Val Pro Val Pro Sr 

230 235 2 4o 

gga tea gta act gaa ttg gtt gca act ctt gaa aaa gac gta act otc 7«« 
Gly Ser Val Thr Glu Leu Val Ala Thr Leu Slu Lys Sp vS Sr lit 
245 250 255 

1^ I? 3 vti f ^ f? fc atS aM gCa SCa gct aac 9 at tc * tac ggt 816 

Glu Glu Val Asn Ala Ala Met Lys Ala Ala Ala Asn Asp Ser Tyr Gly 

260 265 270 

tat act gaa gat cca ate gta tea tct gat ate gtt ggt att tea tac a 64 
Tyr Thr Glu Asp Pro He Val Ser Ser Lp He Val 3J Tit Ser" tJt 
275 280 2B5 . * 

ggt tea ttg ttt gat get act caa act aaa gtt caa act gtt gac oat 912 
Gly Ser Leu Phe Asp Ala Thr Gin Thr Lys Val Gin Thr Val £p gy 
290 295 3 oo 

aac caa ttg gtt aaa gtt gtt tea tgg tac gat aac gaa atg tea tac 960 
Asn Gin Leu Val Lys Val Val Ser Trp Tyr Lp Asn Ilu Mel SeJ Tyr 

310 315 320 

2! £ aCa _ Ctt ??* tac tfct 9« -a ate get aaa 



1008 



1011 



FIG. 2B 



JSDOCID: <WO 0196381 A2J_> 



SUBSTITUTE SHEET (RULE 26) 



WO 01/96381 



PCT/CA01/00838 



' 5/34 



atg gta gtt aaa gtt ggt att aac ggt ttc ggt cgt ate gga cgt ctt 48 
Met Val Val Lys Val Gly lie Asn Gly Phe Gly Arg lie Gly Arg Leu 
1 5 10 is 

gca ttc cgt cgt att caa aac gtt gaa ggt gtt gaa gta act cgt att 96 
Ala Phe Arg Arg lie Gin Asn Val Glu Gly Val Glu Val Thr Arg lie 
20 25 30 

aac gat ctt act gac cca aat atg ctt gca cac ttg ttg aaa tat gat 144 
Asn Asp Leu Thr Asp Pro Asn Met Leu Ala His Leu Leu Lys Tyr Asp 
35 40 45 

aca act caa ggt cgt ttc gac ggt aca gtt gaa gtt aaa gat ggt gga 192 
Thr Thr Gin Gly Arg Phe Asp Gly Thr Val Glu Val Lys Asp Gly Gly 
SO 55 60 

ttc gaa gtt aac gga aac ttc ate aaa gtt tct get gaa aaa gat cca 240 
Phe Glu Val Asn Gly Asn Phe lie Lys Val Ser Ala Glu Lys Asp Pro 
65 70 75 80 

gaa aac att gac tgg gca act gac ggt gta gaa ate gtt ctt gaa gca 2 88 
Glu Asn lie Asp Trp Ala Thr Asp Gly Val Glu lie Val Leu Glu Ala 
85 90 95 

act ggt ttc ttt get aaa aaa gca get get gaa aaa cat tta cat get 336 
Thr Gly Phe Phe Ala Lys Lys Ala Ala Ala Glu Lys His Leu His Ala 
100 105 110 

aac ggt get aaa aaa gtt gtt ate aca get cct ggt gga gat gat gtt 384 
Asn Gly Ala Lys Lys Val Val lie Thr Ala Pro Gly Gly Asp Asp Val 
115 120 125 

aaa act gtt gta ttt aac aca aac cat gac att ctt gac ggt aca gaa 432 
Lys Thr Val Val Phe Asn Thr Asn His Asp He Leu Asp Gly Thr Glu 
130 135 140 

act gta att tea ggt get tea tgt act act aac tgt tta get cca atg 480 
Thr Val He Ser Gly Ala Ser Cys Thr Thr Asn Cys Leu Ala Pro Met 

150 155 160 

get aaa get ttg caa gat aac ttt ggt gtt aaa caa ggt ttg atg aca 528 
Ala Lys Ala Leu Gin Asp Asn Phe Gly Val Lys Gin Gly Leu Met Thr 
165 170 175 

act ate cac get tac act ggt gac caa atg ate ctt gac gga cca cac 576 
Thr He His Ala Tyr Thr Gly Asp Gin Met He Leu Asp Gly Pro His 
180 185 190 

cgt ggt ggt gac ctt cgt cgt get cgt get ggt gca age aac att gtt 624 
Arg Gly Gly Asp Leu Arg Arg Ala Arg Ala Gly Ala Ser Asn He Val 
195 200 205 
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cct aac tea act ggt get get aaa gca ate ggt ctt gta ate cca gaa 672 
Pro Asn Ser Thr Gly Ala Ala Lys Ala lie Gly Leu Val lie Pro Glu 
210 215 220 

tta aat ggt aaa ctt gac ggt get gca caa cgt gtt cct gtt cca act 720 
Leu Asn Gly Lys Leu Asp Gly Ala Ala Gin Arg Val Pro Val Pro Thr 
225 230 235 240 

gga tea gta act gaa tta gta gca gtt ctt gaa aaa gaa act tea gtt 768 
Gly Ser Val Thr Glu Leu Val Ala Val Leu Glu Lys Glu Thr Ser Val 
245 250 " 255 

gaa gaa ate aac gca gca atg aaa gca get gca aac gat tea tac gga 816 
Glu Glu He Asn Ala Ala Met Lys Ala Ala Ala Asn Asp Ser Tyr Gly 
260 265 270 

tac act gaa gac cca ate gta tct tct gat ate ate ggt atg get tac 864 
Tyr Thr Glu Asp Pro He Val Ser Ser Asp He He Gly Met Ala Tyr 
275 280 ■ ' 285 

ggt tea ttg ttt gat get act caa act aaa gta caa act gtt gat gga 912 
Gly Ser Leu Phe Asp Ala Thr Gin Thr Lys Val Gin Thr Val Asp Gly 
290 295 300 

aat caa tta gtt aaa gtt gtt tea tgg tat gac aac gaa atg tct tac 960 
Asn Gin Leu Val Lys Val Val Ser Trp Tyr Asp Asn Glu Met Ser Tyr 
305 310 315 320 

act gca caa ctt gtt cgt act ctt gag tac ttt gca aaa ate get aaa 1008 
Thr Ala Gin Leu Val Arg Thr Leu Glu Tyr Phe Ala Lys He Ala Lys 
325 330 335 

taa 1011 
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atg gta gtt aaa gtt ggt att aac ggt ttt ggc cgt ate gga cgt ctt 48 

Met Val Val Lys Val Gly He Asn Gly Phe Gly Arg lie Gly Arg Leu 
1 5 10 15 

get ttc cgt cgt att caa aat gta gaa ggt gtt gaa gtt act cgc ate 96 
Ala Phe Arg Arg lie . Gin Asn Val Glu Gly Val Glu Val Thr Arg lie 
20 25 30 



aac gac ctt aca gat cca aat atg ctt gca cac ttg tta aaa tac gat 
Asn Asp Leu Thr Asp Pro Asn Met Leu Ala His Leu Leu Lys Tyr Asp 
35 40 45 



ttt gac gtt aac gga aaa ttc att aaa gtt tct get gaa aaa gat cca 
Phe Asp Val Asn Gly Lys Phe lie Lys Val Ser Ala Glu Lys Asp Pro 



65 70 75 



80 



aca gtt att tea ggt get tea tgt act aca aac tgt tta get cca atg 
Thr Val He Ser Gly Ala Ser Cys Thr Thr Asn Cys Leu Ala . Pro Met 
145 150 155 160 
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aca act caa ggt cgt ttt gac ggt act gta gaa gtt aaa gat ggt gga 192 
Thr Thr Gin Gly Arg Phe Asp Gly Thr Val Glu Val Lys Asp Gly Gly 
50 55 60 



240 



gaa caa att gac tgg gca act gac ggt gtt gaa ate gtt ctt gaa gca 288 
Glu Gin He Asp Trp Ala Thr Asp Gly Val Glu He Val Leu Glu Ala 
85 go 95 

act ggt ttc ttt get aaa aaa gca get get gaa aaa cat tta cat gaa 336 
Thr Gly Phe Phe Ala Lys Lys Ala Ala Ala Glu Lys His Leu His Glu 
100 105 no 

aat ggt get aaa aaa gtt gtt ate act get cct ggt gga gat gac gtg 3 84 
Asn Gly Ala Lys Lys Val Val He Thr Ala Pro Gly Gly Asp Asp Val 
115 120 125 

aaa aca gtt gta ttt aac act aac cat gat ate ctt gat gga act gaa 432 
Lys Thr Val Val Phe Asn Thr Asn His Asp He Leu Asp Gly Thr Glu 
"0 135 140 



480 



get aaa get tta caa gat aac ttt ggc gta aaa caa ggt tta atg act 528 
Ala Lys Ala Leu Gin Asp Asn Phe Gly Val Lys Gin Gly Leu Met Thr 
165 170 175 

aca ate cac get tac act ggt gat caa atg ctt ctt gat gga cct cac 576 
Thr He His Ala Tyr Thr Gly Asp Gin Met Leu Leu Asp Gly Pro His 
180 185 190 

cgt ggt ggt gac tta cgt cgt gee cgt get ggt get aac aat att gtt 624 
Arg Gly Gly Asp Leu Arg Arg Ala Arg Ala Gly Ala Asn Asn He Val 
195 200 205 
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cct aac tea act ggt get get aaa gca ate ggt ctt gtt ate cct gaa 672 
Pro Asn Ser Thr Gly Ala Ala Lys Ala lie Gly Leu Val lie Pro Glu 
210 215 220 

tta aat ggt aaa ctt gac ggt get gca caa cgt gta cca gtt cca aca 720 
Leu Asn Gly Lys Leu Asp Gly Ala Ala Gin Arg Val Pro Val Pro Thr 
225 230 235 240 

ggt tea gta aca gaa tta gta gca gtt ctt aat aaa gaa act tea gta 768 
Gly Ser Val Thr Glu Leu Val Ala Val Leu Asn Lys Glu Thr Ser Val 
245 250 255 

gaa gaa att aac tea gta atg aaa get gca get aat gat tea tat ggt 816 
Glu Glu lie Asn Ser Val Met Lys Ala Ala Ala Asn Asp Ser Tyr Gly 
260 265 270 

tac act gaa gat cca ate gta tea tct gat ate gtt ggt atg tct ttc 864 
Tyr Thr Glu Asp Pro lie Val Ser Ser Asp lie Val Gly Met Ser Phe 
275 280 285 

ggt tea tta ttc gat get act caa act aaa gta caa act gtt gat gga 912 
Gly Ser Leu Phe Asp Ala Thr Gin Thr Lys Val Gin Thr Val Asp Gly 
290 295 300 

aat caa tta gtt aaa gtt gtt tea tgg tat gac aat gaa atg tct tac 960 
Asn Gin Leu Val Lys Val Val Ser Trp Tyr Asp Asn Glu Met Ser Tyr 
305 310 315 320 

act get caa ctt gat cgt aca ctt gag tac ttt gca aaa ate get aaa 1008 
Thr Ala Gin Leu Asp Arg Thr Leu Glu Tyr Phe Ala Lys lie Ala Lys 
325 330 335 
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atg gta gtt aaa gtt ggt att aac ggt ttc gga cgt ate ggt cgt ctt 
Met Val Val Lys Val Gly lie Asn Gly Phe Gly Arg lie GXy Arg Leu 
1 S io is 



48 



gca ttc cgt cgt att caa aat gtt gaa ggt gtt gaa gta act cgt ate 96 
Ala Phe Arg Arg lie Gin Asn Val Glu Gly Val Glu Val Thr Arg lie 
20 25 30 

aat gac ctt aca gat cct aac atg ctt gca cac ttg ttg aaa tat gat 144 
Asn Asp Leu Thr Asp Pro Asn Met Leu Ala His Leu Leu Lys Tyr Asp 
35 40 45 

aca act caa ggt cgt ttt gac ggt aca gtt gaa gtt aaa gat ggt gga 192 
Thr Thr Gin Gly Arg Phe Asp Gly Thr Val Glu Val Lys Asp Gly Gly 
50 55 60 

ttc gaa gtt aac gga age ttt gtt aaa gtt tct gca gaa cgc gaa cca 240 
Phe Glu Val Asn Gly Ser Phe Val Lys Val Ser Ala Glu Arg Glu Pro 
65 70 75 80 - 

gca aac att gac tgg get act gat ggt gta gac ate gtt ctt gaa gca 28 8 
Ala Asn lie Asp Trp Ala Thr Asp Gly Val Asp He Val Leu Glu Ala 
85 90 95 

aca ggt ttc ttc get tct aaa gca get get gaa caa cac att cac get 336 
Thr Gly Phe Phe Ala Ser Lys Ala Ala Ala Glu Gin His He His Ala 
100 105 110 

aac ggt gcg aaa aaa gtt gtt ate aca get cct ggt gga aat gac gtt 3 84 
Asn Gly Ala Lys Lys Val Val He Thr Ala Pro Gly Gly Asn Asp Val 
115 120 125 

aaa aca gtt gtt tac aac act aac cat gat att ctt gat gga act gaa 432 
Lys Thr Val Val Tyr Asn Thr Asn His Asp He Leu Asp Gly Thr Glu 
I 30 135 140 

aca gtt ate tea ggt get tea tgt act aca aac tgt tta get cca atg 480 
Thr Val He Ser Gly Ala Ser Cys Thr Thr Asn Cys Leu Ala Pro Met 
I 45 150 155 160 

get aaa gca tta caa gat aac ttt ggt gta aaa caa ggt tta atg act 528 
Ala Lys Ala Leu Gin Asp Asn Phe Gly Val Lys Gin Gly Leu Met Thr 
165 170 175 

act ate cat ggt tac act ggt gac caa atg gtt ctt gac gga cca cac 576 
Thr He His Gly Tyr Thr Gly Asp Gin Met Val Leu Asp Gly Pro His 
180 ias 190 

cgt ggt ggt gat ctt cgt cgt get cgt gca get gca gca aac ate gtt 624 
Arg Gly Gly Asp Leu Arg Arg Ala Arg Ala Ala Ala Ala Asn He Val 
135 200 205 
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cct aac tea act ggt get get aaa gca ate ggt ctt gtt ate cca gaa 672 
Pro Asn Ser Thr Gly Ala Ala Lys Ala He Gly Leu Val He Pro Glu 
210 215 220 

tta aat ggt aaa ctt gac ggt get gca caa cgt gtt cct gtt cca act 720 
lieu Asn Gly Lys Leu Asp Gly Ala Aia Gin Arg Val Pro Val Pro Thr 
225 230 235 240 

gga tea gta act gaa tta gta gca gtt ctt gaa aaa gat act tea gta 768 
Gly Ser Val Thr Glu Leu Val Ala Val Leu Glu Lys Asp Thr Ser Val 
245 250 255 

gaa gaa ate aat gca get atg aaa gca gca get aac gat tea tac ggt 816 
Glu Glu He Asn Ala Ala Met Lys Ala Ala Ala Asn Asp Ser Tyr Gly 
260 265 270 

tac act gaa gat get ate gta tea tea gat ate gta ggt att tct tac 864 
Tyr Thr Glu Asp Ala He Val Ser Ser Asp He Val Gly He Ser Tyr 
275 280 285 

ggt tea tta ttt gat get act caa act aaa gta caa act gtt. gat gga 912 
Gly Ser Leu Phe Asp Ala Thr Gin Thr Lys Val- Gin Thr Val Asp Gly 
290 295 300 

aat caa ttg gtt aaa gtt gtt tea tgg tat gac aat gaa atg tct tac 960 
Asn Gin Leu Val Lys Val Val Ser Trp Tyr Asp Asn Glu Met Ser Tyr 
305 310 315 320 



act get caa ctt gtt cgt act ctt gag tac ttt gca aaa ate get aaa 1008 
Thr Ala Gin Leu Val Arg Thr Leu Glu Tyr Phe Ala Lys He Ala Lys 
325 330 335 

taa 1011 
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151 200 
DysGapC ACTCAAGGAC GTTTTGACGG AACTGTTGAA GTTAAAGAAG GTGGATTTGA 

SpyGapC 1-- a--a 1-- 

SeqGapC a- a--t t- - 

ParaUbGapC t--a 1- t-- 

UberGapc >- c a 1- c-- 

AgalGapCDNA c ---fc - _ c _ _ 
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DysGapC AGTAAACGGA AACTTCATCA AAGTTTCTGC TGAACGTGAT CCAGAAAACA 

SpyGapC a 1 

SeqGapC a 1 

ParaUbGapC c --a 1- aaa c-a- 

UberGapc aaa--- 

AgalGapCDNA 1 c-a--tg-t- c--a c- 

SiniGapC -g- -- tg-t- a c--a c 
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DysGapC TCGACTGGGC AACTGACGGT GTTGAAATCG TTCTGGAAGC AACTGGTTTC 

SpyGapC -c t--g 

SeqGapC -c --c 
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UberGapc -t c --a 
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DysGapC TTTGCTAAAA AAGAAGCTGC TGAAAAACAC TTACATGCTA ACGGTGCTAA. 

SpyGapC a -- 

SeqGapC c- 

ParaUbGapC c 1 aa- -t 

UberGapc c 1 . 

AgalGapCDNA ate-- aaa-- --g-c a-c- aa- -t ■ 

SiniGapC --c tct- c c a-t--c g-- 
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DysGapC AAAAGTTGTT ATCACAGCTC CTGGTGGAAA CGACGTTAAA ACAGTTGTTT 
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^JcipC TCAACACTAA CCAC <3A.CAT TCTTGACGGT ACTGAAACAG TTATCTCAGG 
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DysGapC TGCTTCATGT ACTACAAACT GTTTAGCTCC TATGGCTAAA GCTCTTCACG 
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551 600 

DysGapC GGTGACCAAA TGATCCTTGA CGGACCACAC CGTGGTGGTG ACCTTCGTCG 

SpyGapC 
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DysGapC AAGCTATCGG TCTTGTTATC CCAGAATTGA ATGGTAAACT TGATGGTGCT 
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DysGapC TCTTGATAAA AACGTTTCTG TTGACGAAAT CAACGCTGCT ATGAAAGCTG 
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<110> University of Saskatchewan 
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Per e z - Casal , Jose 
Fontaine, Michael 
Potter, Andrew A. 

<120> IMMUNIZATION OF pAIRY CATTLE WITH GapC PROTEIN AGAINST 
STREPTOCOCCUS INFECTION 

<13 0> 08-891S14WO 

<140> 
<141> 

<160> 12 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer gapCl 
<400> 1 

ggcggcggca tatggtagtt aaagttggta ttaacgg 37 

<210> 2 
<211> 35 

<212> DNA - 
<213> Artificial Sequence 

<220> 

<223> Description .of Artificial Sequence: primer gapClr 



<400> 2 

gcggatcctt atttagcgat ttttgcaaag tactc 

<210> 3 
<211> 1011 
<212> DNA 

<213> Streptococcus dysgalactiae 

<220> 

<221> CDS 

<222> (1) . . (1011) 

<400> 3 



35 
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atg gta gtt aaa gtt ggt att aac ggt ttc ggt cgt ate gga cgt ctt 
Met Val Val Lys Val Gly lie Asn Gly Phe Gly Arg lie Gly Arg Leu 
1 5 * ■ 10 is 

gca ttc cgt cgt att caa aat gtt gaa ggt gtt gaa gta act cgt ate 
Ala Phe Arg Arg lie Gin Asn Val Glu Gly Val Glu Val Thr Arg He 
20 25 30 ' 



48 



96 



aac .gac ctt aca gat cca aac atg ctt gca cac ttg ttg aaa tac gat 144 
Asn Asp Leu Thr Asp Pro Asn Met Leu Ala His Leu Leu Lys Tyr Asp 
35 40 45 

aca act caa gga cgt. ttt gac gga act gtt gaa gtt aaa gaa ggt gga 192 
Thr , Thr Gin Gly Arg Phe Asp Gly Thr Val Glu Val Lys Glu Gly Gly 
50 55 60 

ttt gaa gta aac gga aac ttc ate aaa gtt tct get gaa cgt gat cca - 240 
Phe Glu Val Asn Gly Asn Phe He Lys Val Ser Ala Glu Arg Asp. Pro 
65 70 75 80 

gaa aac ate gac tgg gca act gac ggt gtt gaa ate gtt ctg gaa gca 288 
Glu Asn He Asp Trp Ala Thr Asp Gly Val Glu lie Val Leu Glu Ala 
85 * 90 95 

act ggt ttc ttt get aaa aaa gaa get get gaa aaa cac tta cat get 336 
Thr Gly Phe Phe Ala Lys Lys Glu Ala Ala Glu Lys His Leu His Ala 
100 . 105 iio 

aac ggt get aaa aaa gtt gtt ate aca get cct ggt gga aac gac gtt 384 
Asn Gly Ala Lys Lys Val Val He Thr Ala Pro Gly Gly Asn Asp Val 
115 12 0 125 

aaa aca gtt gtt ttc aac act aac cac gac att ctt gac ggt act gaa 432 
Lys Thr Val Val Phe Asn Thr Asn His Asp He Leu Asp Gly Thr Glu 
130 135 140 

aca gtt ate tea ggt get tea tgt act aca aac tgt tta get cct atg 480 
Thr Val He Ser Gly Ala Ser Cys Thr Thr Asn Cys Leu Ala Pro Met 
145 150 155 160 

get aaa get ctt cac gat gca ttt ggt ate caa aaa ggt ctt atg act 528 
Ala Lys Ala Leu His Asp Ala Phe Gly He Gin Lys Gly Leu Met Thr 
165 170 175 

aca ate cac get tat act ggt gac caa atg ate ctt gac gga cca cac 576 
Thr. lie His Ala Tyr Thr Gly Asp Gin Met lie Leu Asp Gly Pro His 
180 185 190 

cgt ggt ggt gac ctt cgt cgt get cgt get ggt get gca aac att gtt 624 
Arg Gly Gly Asp Leu Arg Arg Ala Arg Ala Gly Ala Ala Asn He Val 
195 200 205 



cct aac tea act ggt get get aaa " get ate ggt ctt gtt ate cca gaa . 672 
Pro Asn Ser Thr Gly Ala Ala Lys Ala He Gly Leu Val He Pro Glu 
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220 



215 



220 



ttg aat ggt aaa ctt gat ggt get gca caa cgt gtt cct gtt cca act 720 
Leu Asn Gly Lys Leu Asp Gly Ala Ala Gin Arg Val Pro Val Pro Thr 
225 230 235 240 

gga tea gta act gag ttg gtt gta act ctt gat aaa aac gtt tct gtt 768 
. Gly Ser Val Thr Glu Leu Val Val Thr Leu Asp Lys Asn Val Ser Val 
245 250 255 

gac gaa ate aac get get atg aaa get get tea aac gac agt ttc ggt 816 
Asp Glu lie Asn Ala Ala Met Lys Ala Ala Ser Asn Asp Ser Phe Gly 
260 265 270 

tac act gaa gat cca att gtt tct tea gat ate gta ggc gtg tea tac 864 
Tyr Thr Glu Asp Pro lie Val Ser Ser Asp lie Val Gly Val Ser Tyr 
275 280 285 



ggt tea ttg ttt gac gca act caa act aaa gtt atg gaa gtt gac gga 
Gly Ser Leu Phe Asp Ala Thr Gin Thr Lys Val Met Glu Val Asp Gly 
290 295 300 



912 



tea caa ttg gtt aaa gtt gta tea tgg tat gac aat gaa atg tct tac 
Ser Gin Leu Val Lys Val Val Ser Trp Tyr Asp Asn Glu Met Ser Tyr 
305 310 315 320 



960 



act get caa ctt gtt cgt aca ctt gag tac ttt gca aaa ate get aaa 
Thr Ala Gin Leu Val Arg Thr Leu Glu Tyr Phe Ala Lys He Ala Lys 
325 330 • 335 



1008 



taa 



1011 



<210> 4 
<211> 336 
<212> PRT 

<213> Streptococcus . dysgalactiae 
<400> 4 

Met Val Val Lys Val Gly He Asn Gly Phe Gly Arg lie Gly Arg Leu 
1 5 10 15 

Ala Phe Arg Arg He Gin Asn Val Glu Gly Val Glu Val Thr Arg He 
20 25 30 - 

Asn Asp Leu Thr Asp Pro Asn Met Leu Ala His Leu Leu Lys Tyr Asp 
35 40 45 

Thr Thr Gin Gly Arg Phe Asp Gly Thr Val Glu Val Lys Glu Gly Gly 
50 55 60 

Phe Glu Val Asn Gly Asn Phe lie Lys Val Ser Ala Glu Arg Asp Pro 
65 70 75 80 



Glu Asn He Asp Trp Ala Thr Asp Gly Val Glu He Val Leu Glu Ala 
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Thr Gly Phe Phe Ala Lys Lys Glu Ala Ala Glu Lys His Leu His Ala 
100 105 no 

Asn Gly Ala Lys Lys Val Val lie Thr Ala Pro Gly Gly Asn Asp Val 
115 120 125 

Lys Thr Val Val Phe Asn Thr Asn His Asp lie Leu Asp Gly Thr Glu 
130 135 ' 140 

Thr Val He Ser Gly Ala Ser Cys Thr Thr Asn Cys Leu Ala Pro Met 
14 5 150 155 160 

Ala Lys Ala Leu His Asp Ala Phe Gly He Gin Lys Gly Leu Met Thr 
165 170 175 

Thr He His Ala Tyr Thr Gly Asp Gin Met He Leu Asp Gly Pro His 
180 185 190 

Arg Gly Gly Asp Leu Arg Arg Ala Arg Ala Gly Ala Ala Asn He Val 
195 200 205 

Pro Asn Ser Thr Gly Ala Ala Lys Ala He Gly Leu Val He Pro Glu 
210 215 220 

Leu Asn Gly Lys Leu Asp Gly Ala Ala Gin Arg Val Pro Val Pro Thr 

225 230 235 .. 240 

Gly Ser Val Thr Glu Leu Val Val Thr Leu Asp. Lys Asn Val Ser Val 
245 250 255 

Asp Glu He Asn Ala Ala Met Lys Ala Ala Ser Asn Asp Ser Phe Gly 
260 265 270 

Tyr Thr Glu Asp Pro lie Val Ser Ser Asp lie Val Gly Val Ser Tyr 
275 280 285 

Gly Ser Leu Phe Asp Ala Thr Gin Thr Lys Val Met Glu Val Asp Gly 
290 295 300 

Ser Gin Leu Val Lys Val Val Ser Trp Tyr Asp Asn Glu Met Ser Tyr 
305 310 ".■ 315 320 

Thr Ala Gin Leu Val Arg Thr Leu Glu Tyr Phe Ala Lys He Ala Lys 
325 330 335 



<210> 5 
<211> 1011 
<212> DNA 

<213> Streptococcus agalactiae 

<220> 

<221> CDS 

<222> (1) . . (1011) 
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<400> 5 

atg gta gtt aaa gtt ggt att aac ggt ttc ggt cgt ate ggt cgt ctt 4 8 

Met Val Val Lys Val Gly lie Asn Gly Phe Gly Arg lie Gly Arg Leu 

1 5 10 15 

gca ttc cgt cgc ate caa aac gta gaa ggt gtt gaa gtt act cgt ate 96 
Ala Phe Arg Arg He Gin Asn Val Glu Gly Val Glu Val Thr Arg He 
20 25 30 

aac gac ctt aca gat cca aac atg ctt gca cac ttg ttg aaa tat 1 gac 144 
Asn Asp Leu. Thr Asp Pro Asn Met Leu Ala His Leu Leu Lys Tyr Asp 
35 40 45 

aca act caa ggt cgt ttc gac ggt act gtt gaa' gtt aaa gaa ggt gga 192 
Thr Thr Gin Gly Arg Phe Asp Gly Thr Val Glu Val Lys Glu Gly Gly 
50 55 60 

ttc gaa gtt aac ggt caa ttt gtt aaa gtt tct get gaa cgc gaa cca 240 
Phe Glu Val Asn Gly Gin Phe Val Lys Val Ser Ala Glu Arg Glu Pro 
65 w 70 75 80 

gca. aac att gac tgg get , act gat ggc gta gaa ate gtt ctt gaa gca 2 88 
Ala Asn lie Asp Trp Ala Thr Asp Gly Val Glu He Val Leu Glu Ala 
85 90 95 

act ggt ttc ttt gca tea aaa gaa aaa get gga caa cac ate cat gaa 336 
Thr Gly Phe Phe Ala Ser Lys Glu Lys Ala Gly Gin His He His Glu 
100 105 110 

aat ggt get aaa aaa gtt gtt ate aca get cct ggt gga aac gac gtt 384 
Asn Gly Ala Lys Lys Val Val He Thr Ala Pro Gly Gly Asn Asp Val 
115 120 125 

aaa aca- gtt gtt ttc aac act aac cac gat ate ctt gat gga act gaa 432 
Lys Thr Val Val Phe Asn Thr Asn His Asp He Leu Asp Gly Thr Glu 
130 135 140 

aca gtt ate tea ggt get tea tgt act aca aac tgt ctt get cca atg 480 
Thr Val He Ser Gly Ala Ser Cys Thr Thr Asn Cys lieu Ala Pro Met 
145 150 155 160 

get aaa get tta caa gac aac ttt ggt gtt aaa caa ggt ttg atg act 528 
Ala Lys Ala Leu Gin Asp Asn Phe Gly Val Lys Gin Gly Leu Met Thr 
165 170 ~ 175 



act ate cac gca tac act ggt gac caa atg ate ctt gac gga cca cac 576 
Thr lie His Ala Tyr Thr Gly Asp Gin Met lie Leu Asp Gly Pro His 
180 185 190 

cgt ggt ggt gac ctt cgt cgt get cgt gca ggt get gca aac ate gtt 624 
Arg Gly Gly Asp Leu Arg Arg Ala Arg Ala Gly Ala Ala Asn He Val 
195 200 205 

cct aac tea act ggt get gca aaa get ate gga ctt gtt ate cca gaa 672 
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Pro Asn Ser Thr Gly Ala Ala Lys Ala lie Gly Leu Val He Pro Glu 
210 215 220 

ttg aac ggt aaa ctt gat ggt get gca caa cgt gtt cct gtt cca act 72 0 
Leu Asn Gly Lys Leu Asp Gly Ala Ala Gin Arg Val Pro Val Pro Thr 
225 2 30 235 2 40 

gga tea gta act gaa ttg gtt gca act ctt gaa aaa gac gta act gtc 768 
Gly Ser Val Thr Glu Leu Val Ala Thr Leu Glu Lys Asp Val Thr Val 
245 250 " 255 

gaa gaa gta aat gca get atg aaa gca gca get aac gat tea tac ggt 816 
Glu Glu Val Asn Ala Ala Met Lys Ala Ala Ala Asn Asp Ser Tyr Gly 
260 265 270 

tat act gaa gat cca ate gta tea tct gat ate gtt ggt att tea tac 864 
Tyr Thr Glu Asp Pro He Val Ser Ser- Asp He Val Gly He Ser Tyr 
275 280 285 

ggt tea ttg ttt gat get act caa act aaa gtt caa act gtt gac ggt 912 
Gly Ser Leu Phe Asp Ala Thr Gin Thr Lys Val Gin Thr Val Asp Gly 
2^0 295 .300 

aac caa ttg gtt aaa gtt gtt tea tgg tac gat aac gaa atg tea tac 
Asn Gin Leu Val Lys Val Val Ser Trp Tyr Asp Asn Glu Met Ser Tyr 
305 310 315 320 



960 



act tea caa ctt gtt cgt aca ctt gag tac t-tt gca aaa ate get aaa 1008 

Thr Ser Gin Leu Val Arg Thr Leu Glu Tyr Phe Ala Lys lie Ala Lys 

325 330 335 

taa 1011 



<210> 6 
-<211> 336 
<212> PRT 

<213> Streptococcus agalactiae 
<400> 6 

Met Val Val Lys Val Gly He Asn Gly Phe Gly Arg He Gly * Arg Leu 
1 • 5 10 15 



Ala Phe Arg Arg He Gin Asn Val Glu Gly Val Glu Val Thr Arg He 
20 25 30 

Asn Asp Leu Thr Asp Pro Asn Met Leu Ala His Leu Leu Lys Tyr Asp 

35 * 40 . ' 45 

Thr Thr Gin Gly Arg Phe Asp Gly Thr Val Glu Val Lys Glu Gly Gly 
50 55 60 

Phe Glu Val Asn Gly Gin Phe Val Lys Val Ser Ala Glu Arg Glu Pro 
65 70 75 80 
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Ala Asn He Asp Trp Ala Thr Asp Gly Val Glu lie Val Leu Glu Ala 
85 90 95 

Thr Gly Phe Phe Ala Ser Lys Glu Lys Ala Gly Gin His He His Glu 
100 105 i 10 

Asn Gly Ala Lys Lys Val Val lie Thr Ala Pro Gly Gly Asn Asp Val 
115 120 125 

Lys Thr Val Val Phe Asn Thr Asn His Asp He Leu Asp Gly Thr Glu 
130 135 140 

Thr Val lie Ser Gly Ala Ser Cys Thr Thr Asn Cys Leu Ala Pro Met 



145 150 155 



ISO 



Ala Lys Ala Leu Gin Asp Asn Phe Gly Val Lys Gin Gly Leu Met Thr 
155 ivo 175 

Thr He His Ala Tyr Thr Gly Asp Gin Met He Leu Asp Gly p ro His 
180 185 i 9 o 

Arg Gly Gly Asp Leu Arg Arg Ala Arg Ala Gly Ala Ala Asn He Val 
195 200 205 

Pro Asn Ser Thr Gly Ala Ala Lys Ala He Gly Leu Val He Pro Glu 
21° 215 220 

Leu Asn Gly Lys Leu Asp Gly Ala Ala Gin Arg Val Pro Val Pro Thr 
225 230 235 240 

Gly ser Val Thr Glu Leu Val Ala Thr Leu Glu Lys Asp Val Thr Val 
245 250 255 

Glu Glu Val Asn Ala Ala Met Lys Ala Ala Ala Asn Asp Ser Tyr Gly 
260 265 270 

Tyr Thr Glu Asp Pro He Val Ser Ser Asp He Val Gly He Ser Tyr 
275 280 285 

Gly ser Leu Phe Asp Ala Thr Gin Thr Lys Val Gin Thr Val Asp Gly 
290 295 300 

Asn Gin Leu Val Lys Val Val Ser Trp Tyr Asp Asn Glu Met Ser Tyr 
305 . 310 315 320 

Thr Ser Gin Leu Val Arg Thr Leu Glu Tyr Phe Ala Lys He Ala Lys 
325 330 335- 



<210> 7 
<211> 1011 
<212> DNA 

<213> Streptococcus uberis 

<220> 
<221> CDS 
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<222> (1) . . (1011) 



<400> 7 



atg gta gtt aaa gtt ggt att aac ggt ttc ggt. cgt ate gga cgt ctt 48 
Met Val Val Lys Val Gly He Asn Gly Phe Gly Arg He Gly Arg Leu 
1 5 10 15 

gca ttc cgt cgt att caa aac gtt gaa ggt gtt gaa gta act cgt att 96 
Ala Phe Arg Arg He Gin Asn Val Glu Gly Val Glu Val Thr Arg lie 
20 25 30 

aac gat ctt act gac cca aat atg ctt gca cac ttg ttg aaa tat gat 144 
Asn' Asp Leu Thr Asp Pro Asn Met Leu Ala His Leu Leu Lys Tyr Asp 
35 40 45 



aca act caa ggt cgt ttc gac ggt aca gtt gaa gtt aaa gat ggt gga 
Thr Thr Gin Gly Arg Phe Asp Gly Thr Val Glu Val Lys Asp Gly Gly 
50 55 60 
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ttc gaa gtt aac gga aac ttc ate aaa gtt tct get gaa aaa gat cca 240 
Phe Glu Val Asn Gly Asn Phe lie Lys Val Ser Ala Glu Lys Asp Pro 
65 70 75 '80 

gaa aac att gac tgg gca act gac ggt gta gaa ate gtt ctt gaa gca 2 88 
Glu Asn He Asp Trp Ala Thr Asp Gly Val Glu He Val Leu Glu Ala 
85 90. 95 

act ggt . ttc ttt get aaa aaa gca get get gaa aaa cat tta cat get 336 
Thr Gly Phe Phe Ala' Lys Lys Ala Ala Ala Glu Lys His Leu His Ala 
100 105 no 

aac ggt get aaa aaa gtt gtt ate aca get cct ggt gga gat gat gtt 384 
Asn Gly Ala Lys Lys Val Val He Thr Ala Pro Gly Gly Asp Asp Val 
115 120 125 

aaa act gtt gta ttt aac aca aac cat gac att ctt gac ggt aca gaa 432 
Lys Thr Val Val Phe Asn Thr Asn His Asp He Leu Asp Gly Thr Glu 
130 135 140 



act gta att tea ggt get tca.tgt act act aac tgt tta get cca atg 480 

Thr Val He Ser Gly Ala Ser Cys Thr Thr Asn Cys Leu Ala Pro Met 
145 150 155 160 

get aaa get ttg caa gat aac ttt ggt gtt aaa caa ggt ttg atg aca 528 

Ala Lys Ala Leu Gin Asp Asn Phe Gly Val Lys Gin Gly Leu Met Thr 

165 170 175 

act ate cac get tac act ggt gac caa atg ate ctt gac gga cca cac 576 

Thr He His Ala Tyr Thr Gly Asp Gin Met He Leu Asp Gly Pro His 
180 185 . 190 

cgt ggt ggt gac ctt cgt cgt get cgt get ggt gca age aac att gtt 624 

Arg Gly Gly Asp Leu Arg Arg Ala Arg Ala Gly Ala Ser Asn He Val 
195 200 205 



4SDOCID: <WO 0196361 A2_l_> 



WO 01/96381 



PCT/CA01/00838 



cct aac tea act ggt get get aaa gca ate ggt ctt gta ate cca gaa 672 
Pro Asn Ser Thr Gly Ala Ala Lys Ala lie Gly Leu Val He Pro Glu 
210 215 220 

tta aat ggt aaa ctt gac ggt get gca caa cgt gtt cct gtt cca act 72 0 
Leu Asn Gly Lys Leu Asp Gly Ala Ala Gin Arg Val Pro Val Pro Thr 
225 230 235 240 

gga tea gta act gaa tta gta gca gtt ctt gaa aaa gaa act tea gtt 768 
Gly Ser Val Thr Glu Leu Val Ala Val Leu Glu Lys Glu Thr Ser Val 
245 250 255 

gaa gaa ate aac gca gca atg aaa gca get gca aac gat tea tac gga 816 
Glu Glu lie Asn Ala Ala Met Lys Ala Ala Ala Asn Asp Ser Tyr Gly 
260 265 270 

tac act gaa gac cca ate gta tct tct gat ate ate ggt atg get tac 864 
Tyr Thr Glu Asp Pro He Val Ser Ser Asp He He Gly Met Ala Tyr 
275 280 285 

ggt tea ttg ttt gat get act caa act aaa gta caa act gtt gat gga 912 
Gly Ser Leu Phe Asp Ala Thr Gin Thr Lys Val Gin Thr Val Asp Gly 
290 . 1 295 , 300 

aat caa tta gtt aaa gtt gtt tea tgg tat gac aac gaa atg tct tac 960 
Asn Gin Leu Val Lys Val Val Ser Trp Tyr Asp Asn Glu Met Ser Tyr 
305 310 315 320 

act gca caa ctt gtt cgt act ctt gag tac ttt gca aaa ate get aaa 1008 
Thr Ala Gin Leu Val Arg Thr Leu Glu Tyr Phe Ala Lys He Ala Lys 
325 330 335 



taa 



<210> 8 

<211> 336 

<212> PRT 

<213> Streptococcus uberis 

<400> 8 

Met Val Val Lys Val Gly He Asn Gly Phe Gly Arg He Gly Arg Leu 
1 5 10 15 

Ala Phe Arg Arg He Gin Asn Val Glu Gly Val Glu Val Thr Arg He 
20 25 30 

Asn Asp Leu Thr Asp Pro Asn Met Leu Ala His Leu Leu Lys Tyr Asp 
35 40 45 

Thr Thr Gin Gly Arg Phe Asp Gly Thr Val. Glu Val Lys Asp Gly Gly 
50 55 60 

Phe Glu Val Asn Gly Asn Phe He Lys Val Ser Ala Glu Lys Asp Pro 
65 70 75 80 



1011 
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Glu Asn He Asp Trp Ala Thr Asp .Gly Val Glu lie Val Leu Glu Ala 
85 90 95 

Thr Gly Phe Phe Ala Lys Lys Ala Ala Ala Glu Lys His Leu His Ala 
100 105 no 

Asn Gly Ala Lys Lys Val Val lie Thr Ala Pro Gly Gly Asp Asp Val 
H5 120 125 

Lys Thr Val Val Phe Asn Thr Asn His Asp He Leu Asp Gly Thr Glu 
130 135 140 

Thr Val He Ser Gly Ala Ser Cys Thr Thr Asn Cys Leu Ala Pro Met 
145 150 155 160 

Ala Lys Ala Leu Gin Asp Asn Phe Gly Val Lys Gin Gly Leu Met Thr 
165 170 175 

Thr He His Ala Tyr Thr Gly Asp Gin Met He Leu Asp Gly Pro His 
180 185 190 

Arg Gly Gly Asp Leu Arg Arg Ala Arg Ala Gly Ala Ser Asn He Val 
135 200 2,05 

Pro Asn Ser Thr Gly Ala Ala Lys Ala He Gly Leu Val He Pro Glu 
21° 215 220 

Leu Asn Gly Lys Leu Asp Gly Ala Ala Gin Arg Val Pro Val Pro Thr 
225 230 235 240 

Gly Ser Val Thr Glu .Leu Val Ala Val Leu Glu Lys Glu Thr Ser Val 
245 250 255 

Glu Glu He Asn Ala Ala Met Lys Ala Ala Ala Asn Asp Ser Tyr Gly 
260 265 270 

Tyr Thr Glu Asp' Pro He Val Ser Ser Asp He He Gly Met Ala Tyr 
275 280 285 

Gly Ser Leu Phe Asp Ala Thr Gin Thr Lys Val Gin Thr Val Asp Gly 
290 295 300 

Asn Gin Leu Val LJns Val Val Ser Trp Tyr Asp Asn Glu Met Ser Tyr 
305 310 315 320 

Thr Ala Gin Leu Val Arg Thr Leu Glu Tyr Phe Ala Lys He Ala Lys 
325 330 335 



<210> 9 
<211> 1011 
<212> DNA 

<213> Streptococcus parauberis 

<220> 
<221> CDS 
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<222> (1) . . (1011) 
<400> 9 

atg gta gtt aaa gtt ggt att aac ggt ttt ggc cgt ate gga cgt ctt 48 
Met Val Val Lys Val Gly lie Asn Gly Phe Gly Arg lie Gly Arg Leu 
1 5 10 15 

get ttc cgt cgt att caa aat gta gaa ggt gtt gaa gtt act cgc ate . 96 
Ala Phe Arg Arg He Gin Asn Val Glu Gly Val Glu Val Thr Arg He 
20 25 30 

aac gac ctt aca gat cca aat atg ctt gca cac ttg tta aaa tac gat 144 
Asn Asp Leu Thr Asp Pro Asn Met Leu Ala His Leu Leu Lys Tyr Asp 
35 40 45 

aca act caa ggt cgt ttt gac ggt act gta gaa gtt aaa gat ggt gga 192 
Thr Thr Gin Gly Arg Phe Asp Gly Thr Val Glu Val Lys Asp Gly Gly 
50 55 60 

ttt gac gtt aac gga aaa ttc att aaa gtt tct get gaa aaa gat cca , 240 
Phe Asp Val Asn Gly Lys Phe He Lys Val Ser Ala Glu Lys Asp Pro 
65 70 75 80 

gaa caa att gac tgg gca act gac ggt gtt gaa ate gtt ctt gaa gca 288 
Glu Gin lie Asp Trp Ala Thr Asp Gly Val Glu He Val Leu Glu Ala 
85 90 95 

act ggt ttc ttt get aaa aaa gca get get gaa aaa cat tta cat gaa 336 
Thr Gly Phe Phe Ala Lys Lys. Ala Ala Ala Glu Lys His Leu His Glu 
100 105 110 

aat ggt get aaa aaa gtt gtt ate act get cct ggt gga gat gac gtg 384 
Asn Gly Ala Lys Lys Val Val He Thr Ala Pro Gly Gly Asp Asp Val 
115 120 125 

aaa aca gtt gta ttt aac act aac cat gat ate ctt gat gga act gaa 432 
Lys Thr Val Val Phe Asn Thr Asn His Asp He Leu Asp Gly Thr Glu 
130 135 140 

aca gtt att tea ggt get. tea tgt act aca aac tgt tta get cca atg .480 
Thr Val He Ser Gly Ala Ser Cys Thr Thr Asn Cys Leu Ala Pro Met 
145 150 155 160 

get aaa get tta caa gat aac ttt ggc gta aaa caa ggt tta atg act 528 
Ala Lys Ala Leu Gin Asp Asn Phe Gly Val Lys Gin Gly Leu Met Thr 
165 170 175 

aca ate cac get tac act ggt gat caa atg ctt ctt gat gga cct cac 576 
Thr He His Ala Tyr Thr Gly Asp Gin Met Leu Leu Asp Gly Pro His 
180 185 190 

cgt ggt ggt gac tta cgt cgt gec cgt get ggt get aac aat att gtt 624 
Arg Gly Gly Asp Leu Arg Arg Ala Arg Ala Gly Ala Asn Asn He Vai 
195 200 205 

cct aac tea act ggt get .get aaa gca ate ggt ctt gtt ate cct gaa 672 
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Pro Asn Ser Thr Gly Ala Ala Lys Ala He Gly Leu Val He Pro Glu 
210 215 220 

tta aat ggt aaa ctt gac ggt get gca caa cgt gta cca gtt cca aca 
Leu Asn Gly Lys Leu Asp Gly Ala Ala Gin Arg Val Pro Val Pro Thr 
225 230 235 240 

ggt tea gta aca gaa tta gta gca gtt ctt aat aaa gaa act tea gta 
Gly Ser Val Thr Glu Leu Val Ala Val Leu Asn Lys Glu Thr Ser Val 
245 250 255 



tac act gaa gat cca ate gta tea .tct gat ate gtt ggt atg tct ttc 
Tyr Thr Glu Asp Pro lie Val Ser Ser Asp He Val Gly Met Ser Phe 
275 280 2 85 



720 



768 



gaa gaa att aac tea gta atig aaa get gca get aat gat tea tat ggt 816 
Glu Glu He Asn Ser Val Met Lys Ala Ala Ala Asn Asp Ser Tyr Gly 
260 265 270 



864 



ggt tea tta ttc gat get act caa act aaa gta caa act gtt gat gga 912 

Gly Ser Leu Phe Asp Ala Thr Gin Thr Lys Val Gin Thr Val Asp Gly 

290 295 300 

aat caa tta gtt aaa gtt gtt tea tgg tat gac aat gaa atg tct tac 960 

Asn Gin Leu Val Lys Val Val Ser Trp Tyr Asp Asn Glu Met Ser Tyr 

3 05 310 .315 320 

act get caa ctt gat cgt aca ctt gag tac ttt gca aaa ate get aaa 1008 

Thr Ala Gin Leu Asp Arg Thr Leu Glu Tyr Phe Ala Lys He Ala Lys 

325 330 335 



taa 



1011 



<210> 10 - 
<211> 336 
<212> PRT 

<213> Streptococcus parauberis 
<400> 10 

Met Val Val Lys Val Gly lie Asn Gly Phe Gly Arg He Gly Arg Leu 
1 5 . 10 15 

Ala Phe Arg Arg He Gin Asn Val Glu Gly Val' Glu Val Thr Arg He 

20 25 . . 30 

Asn Asp Leu Thr Asp Pro Asn Met Leu Ala His Leu Leu Lys Tyr Asp 
35 40 45 

Thr Thr Gin Gly Arg Phe Asp Gly Thr Val Glu Val Lys Asp Gly Gly 
50 55 60 

Phe Asp Val Asn Gly Lys Phe He Lys Val Ser Ala Glu Lys Asp Pro 
65 70 75 80 
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■Glu Gin lie Asp Trp Ala Thr Asp Gly Val Glu He Val Leu Glu Ala 



85 SO 



95 



Thr Gly Phe Phe Ala Lys Lys Ala Ala Ala Glu Lys His Leu His Glu 
100 105 110 

Asn Gly Ala Lys Lys Val Val He Thr Ala Pro Gly Gly Asp Asp Val 
115 120 12S 

Lys Thr Val Val Phe Asn Thr Asn His Asp He Leu Asp Gly Thr Glu 
130 135 140 

Thr val He Ser Gly Ala Ser Cys Thr Thr Asn Cys Leu Ala Pro Met 
145 150 155 1S0 

Ala Lys Ala Leu Gin Asp Asn Phe Gly Val Lys Gin Gly Leu Met Thr 
165 .170 175 

Thr He His Ala Tyr Thr Gly Asp Gin Met Leu Leu Asp Gly Pro His 
180 185 190 

Arg Gly Gly Asp Leu Arg Arg "Ala Arg Ala Gly Ala Asn Asn He Val 
195 200 205 

Pro Asn Ser Thr Gly Ala Ala Lys Ala He Gly Leu Val He Pro Glu 
210 215 220 

1 

Leu Asn Gly Lys Leu Asp Gly Ala Ala Gin Arg Val Pro Val Pro Thr 
225 230 235 240 

Gly Ser Val Thr Glu Leu Val Ala Val Leu Asn Lys Glu Thr Ser Val 
245 250 255 

Glu Glu lie Asn Ser Val Met Lys Ala Ala Ala Asn Asp Ser Tyr Gly 
260 265 270 

• Tyr Thr Glu Asp Pro He Val Ser Ser Asp lie Val Gly Met Ser Phe 
275 280 285 

Gly Ser Leu Phe Asp Ala Thr Gin Thr Lys Val Gin Thr Val Asp Glv 
290 295 300 

Asn Gin Leu Val Lys Val Val Ser Trp Tyr Asp Asn Glu Met Ser Tyr 
305 310 315 . 320 

Thr Ala Gin. Leu Asp Arg Thr Leu Glu Tyr Phe Ala Lys He Ala Lys 
325 330 335 

<210> 11 
<211> 1011 
<212> DNA 

<213> Streptococcus iniae 

<220> 
<221> CDS 
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<222> (1) . m (1011) 
<400> 11 

atg gta gtt aaa gtt ggt att aac ggt ttc gga cgt ate ggt cgt ctt 48 

Met Val Val Lys Val Gly lie Asn Gly Phe Gly Arg lie Gly Arg Leu 
1 5 10 15 



gca ttc cgt cgt att caa aat gtt gaa ggt gtt gaa gta act cgt ate 
Ala Phe Arg Arg lie Gin Asn Val Glu Gly Val Glu Val Thr Arg He 
20 25 30 



96 



aat gac ctt aca gat cct aac atg ctt -gca cac ttg ttg aaa tat gat 144 
Asn Asp Leu Thr^Asp Pro Asn Met Leu Ala His Leu Leu Lys Tyr Asp 
35 40 . 45 

aca act caa ggt cgt ttt gac ggt aca gtt gaa gtt aaa gat ggt gga 192 
Thr Thr Gin Gly Arg Phe Asp Gly Thr Val Glu Val Lys Asp Gly Gly 
50 55 60 

ttc gaa gtt aac gga age ttt gtt aaa gtt tct gca gaa cgc gaa cca 24 0 
Phe Glu Val Asn Gly Ser Phe Val Lys Val Ser Ala Glu Arg Glu Pro 
65 70 75 80 

gca aac att gac tgg get act gat ggt gta gac ate gtt ctt gaa gca 2 88 
Ala Asn He Asp Trp Ala Thr Asp Gly Val Asp He Val Leu Glu Ala 
85 90 95 

aca ggt ttc ttc get tct aaa gca get get gaa caa cac att cac get 336 
Thr Gly Phe Phe Ala Ser Lys Ala Ala Ala Glu Gin His He His Ala 
100 105 no 

aac ggt gcg aaa aaa gtt gtt ate aca get cct ggt g : ga aat gac gtt 384. 
Asn Gly Ala Lys Lys Val Val He Thr Ala Pro Gly Gly Asn Asp Val 
115 120 125 

aaa aca gtt gtt tac aac act aac cat gat att ctt gat gga act gaa 432 
Lys Thr Val Val Tyr Asn Thr Asn His Asp He Leu Asp Gly Thr Glu 
130 135 ' 140 

aca gtt ate tea ggt get tea tgt act aca aac tgt tta get cca atg 480 
Thr Val He Ser Gly Ala Ser Cys Thr Thr Asn Cys Leu Ala Pro. Met 
145. 150 155 160 

get aaa gca tta caa gat aac ttt ggt gta aaa caa ggt tta atg act 528 
Ala Lys Ala Leu Gin Asp Asn Phe Gly Val Lys Gin Gly Leu Met Thr 
165 170 175 

act ate cat ggt tac act ggt gac caa atg gtt ctt gac gga cca cac 576 
Thr He His Gly Tyr Thr Gly Asp . Gin Met Val Leu Asp Gly Pro His 
180 185 190 

cgt ggt ggt gat ctt cgt cgt get cgt gca get gca gca aac ate gtt 62*4 
Arg Gly Gly Asp- Leu Arg Arg Ala Arg Ala Ala Ala Ala Asn He Val 
195 200 205 

cct aac tea act ggt get get aaa gca ate ggt ctt gtt ate cca gaa 6 72 
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Pro Asn Ser Thr Gly Ala Ala Lys Ala 
210 215 

tta aat ggt aaa ctt gac ggt get gca 
v Leu Asn Gly Lys Leu Asp Gly Ala Ala 
225 230 

gga tea gta act gaa tta gta gca gtt 
Gly Ser Val Thr Glu Leu Val Ala Val 
245 

gaa gaa ate aat gca get atg aaa gca 
Glu Glu lie Asn Ala Ala Met Lys Ala 
260 265 

tac act gaa gat get ate gta tea tea 
Tyr Thr Glu Asp Ala lie Val Ser Ser 
275 280 

ggt tea tta ttt gat get act caa act 
Gly Ser Leu Phe Asp Ala Thr Gin Thr 
290 " 295 

aat caa ttg gtt aaa gtt gtt tea tgg 
Asn Gin Leu Val Lys Val Val Ser Trp 
305 310 



act get caa ctt gtt cgt act ctt gag 
Thr Ala Gin Leu Val Arg Thr Leu Glu 
325 

taa 



He Gly Leu Val He Pro Glu 
220 

caa cgt gtt cct gtt cca act 720 
Gin Arg Val Pro Val Pro Thr. 
235 240 

ctt gaa aaa gat act tea gta 768 
Leu Glu Lys Asp Thr Ser Val 
250 25.5 

gca get aac.gat tea tac ggt 816 
Ala Ala Asn Asp Ser Tyr Gly 
270 

gat ate gta ggt att tct tac 864 
Asp He Val Gly He Ser Tyr 
285 

aaa gta caa act gtt gat gga 912 
Lys Val Gin Thr Val Asp Gly 
300 

tat gac aat gaa atg tct tac 960 
Tyr Asp Asn Glu Met Ser Tyr 
315 320 



tac ttt gca aaa ate get aaa 1008 
Tyr Phe Ala Lys He Ala Lys 
330 335 

1011 



<210> 12 
<211> 336 

<212> PRT . * - - ■ 

<213> Streptococcus iniae 

i ■ 

<400> 12 

Met Val Val Lys Val Gly He Asn Gly Phe Gly Arg He Gly Arg Leu 
1 5 10 15 

Ala Phe Arg Arg He Gin Asn Val Glu Gly Val Glu Val Thr Arg He 
20 25 30 

Asn Asp Leu Thr Asp Pro Asn Met Leu Ala His Leu Leu Lys Tyr Asp 
35 40 45 

Thr Thr Gin Gly Arg Phe Asp Gly Thr Val Glu Val Lys Asp Gly Gly 
50 55 60 

Phe Glu Val Asn Gly Ser Phe Val Lys Val Ser Ala Glu Arg Glu Pro 
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65 70 75 



80 



Ala Asn lie Asp Trp Ala Thr Asp Gly Val Asp He Val Leu Glu Ala 
85 so 95 

Thr Gly Phe Phe Ala Ser Lys Ala Ala Ala Glu Gin His He His Ala 
100 105 no 

Asn Gly Ala Lys Lys Val Val He Thr Ala Pro Gly Gly Asn Asp Val 
115 120 125 

Lys Thr Val Val Tyr Asn Thr Asn His Asp He Leu Asp Gly Thr Glu 
130 135 140 

Thr Val He Ser Gly Ala Ser Cys Thr Thr Asn Cys Leu Ala Pro Met 
145 150 .155 160 

\ 

Ala Lys Ala Leu Gin Asp Asn Phe Gly Val Lys Gin Gly Leu Met Thr 
165 170 i7 5 

Thr He ,His Gly Tyr Thr Gly Asp Gin Met Val Leu Asp Gly Pro His 
180 185 190 

Arg Gly Gly Asp Leu Arg Arg Ala Arg Ala Ala Ala Ala Asn He Val 
195 200 205 



Pro Asn Ser Thr Gly Ala Ala Lys Ala He Gly Leu Val He Pro Glu 

210 215 220 

Leu Asn Gly Lys Leu Asp Gly Ala Ala Gin Arg Val Pro Val Pro Thr 

225 230 235 ' 240 

Gly Ser Val Thr Glu Leu Val Ala Val Leu Glu Lys Asp Thr Ser Val 

245 250 255 

Glu Glu He Asn Ala Ala Met Lys Ala Ala Ala Asn Asp Ser Tyr Gly 

260 265 270 

Tyr Thr Glu Asp Ala He Val Ser Ser Asp He Val Gly He Ser Tyr 

275 280 285 

Gly Ser Leu Phe Asp Ala Thr Gin Thr Lys Val Gin Thr Val Asp Gly 

290 295 300 

Asn Gin Leu Val Lys Val Val Ser Trp Tyr Asp Asn Glu Met Ser Tyr 

305 310 315 , 320 

Thr Ala Gin Leu Val Arg Thr Leu Glu Tyr Phe Ala Lys He Ala Lys 

325 330 335 
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